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Abstract 

This  report  represents  the  fifth  year  of  research  performed  under  the 
auspices  of  the  Joint  Services  Electronics  Program  at  Texas  Tech  University. 
The  program  is  concentrated  in  the  "information  electronics"  area  and  in¬ 
cludes  researchers  from  both  the  departments  of  Electrical  Engineering  and 
Mathematics.  Specific  work  units  deal  with  Feedback  System  Design,  Non¬ 
linear  Control,  Nonlinear  Fault  Analysis,  Image  Processing,  and  Pointing 
and  Tracking. 

Each  work  unit  is  represented  in  the  report  by  a  summary  of  the  work 
performed  during  the  past  year,  a  list  of  publications  and  activities  in  the 
area,  reprints  of  all  papers  which  have  been  published  during  the  past  year, 
and  abstracts  of  pending  papers.  In  addition,  the  report  includes  lists 
of  all  grants  and  contracts  administered  by  JSEP  personnel,  the  department 
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Significant  Accomplishments  Report 
A.  Nonlinear  Control 


Ouring  the  past  year  Professor  L.R.  Hunt  and  his  students  working 
jointly  with  researchers  at  NASA/ AMES  have  developed  an  entirely  new 
approach  to  nonlinear  control  system  design  problem.  Tiie  key  to  the 
new  approach  is  the  formulation  of  an  exact  linearization  theory  which 
permits  the  nonlinear  system  under  investigation  to  be  transformed, 
without  approximation,  into  an  equivalent  linear  system  to  which  classical 
design  methodologies  are  applicable.  This  has,  in  turn,  been  achieved 
through  the  application  of  the  powerful  techniques  of  modern  differential 
geometry  through  which  it  has  been  possible  to  convert  the  exact  lin¬ 
earization  concept  from  a  theoretical  abstraction  into  a  viable  design 
algorithm.  Indeed,  the  algorithm  has  already  been  implemented  at  NASA/ 
AMES  in  the  design  of  a  helicopter  autopilot. 

Although  the  exact  linearization  concept  goes  back  to  Poincare  and 
has  been  investigated  by  a  number  of  researchers  during  the  past  decade 
the  class  of  transformations  employed  in  this  work  has  typically  been 
limited  to  those  which  could  be  implemented  by  feedback.  Unfortunately, 
this  class  of  transformations  has  not  proven  to  be  amenable  to  mathematical 
analysis.  To  the  contrary  by  adopting  a  larger  class  of  transformations 
originally  proposed  by  R.  Su,  Professor  Hunt  has  been  able  to  formulate  a 
complete  theory  around  the  exact  linearization  concept.  This  includes 
precise  necessary  and  sufficient  conditions  for  a  plant  to  admit  an  exact 
linearization  and  a  partial  differential  equation  whose  solution  defines 
the  appropriate  transformation  with  which  to  linearize  a  given  system. 
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Although  Or.  Hunt's  research  is  extremely  theoretical  in  nature 
his  research  has  been  closely  coordinated  with  the  autopilot  design 
program  at  NASA/AMES.  Indeed,  AMES  has  already  employed  his  work  in  the 
design  of  an  experimental  helicopter  autopilot  which  is  presently  under- 
going  simulation  and  is  expected  to  fly  in  the  near  future. 

B.  Nonlinear  Fault  Analysis 

During  the  past  year  we  completed  work  (we  believe  successfully)  on  a 
long  standing  JSEP  work  unit  directed  at  the  development  of  an  algorithm 
for  the  solution  of  the  analog  fault  diagnosis  problem.  Although  many 
algorithms  have  been  proposed  over  the  years  which  were  theoretically 
capable  of  determining  the  fault  circuit  components  from  external  measure¬ 
ments  the  problem  has  been  to  find  an  algorithm  which  could  do  the  job  with 
limited  computational  resources  and  constraints  on  the  number  of  measurements 
which  can  be  made  on  the  unit  under  test. 

These  problems  have  now  been  resolved  via  a  new  self- testing  algorithm 
developed  by  Professor  Saeks  and  his  students.  In  essence,  the  algorithm 
exploits  the  fact  that  all  of  the  system  components  do  not  fail  simultaneous¬ 
ly  thereby  permitting  one  to  use  a  subset  of  the  system  components  to  test 
the  remaining  components.  This,  in  turn,  yields  conditional  test  informa¬ 
tion  which  is  valid  if  the  given  subset  of  components  are,  in  fact,  good. 

The  results  of  several  such  conditional  tests  are  then  combined  with  the 
aid  of  an  upper  bound  on  the  number  of  simultaneous  failures  to  obtain  the 
final  diagnosis.  Although  the  procedure  is  somewhat  roundabout  it  meets 
most  of  the  criteria  which  have  been  established  over  the  years  by 
researchers  in  the  analog  fault  diagnosis  area. 

i).  It  is  applicable  to  both  linear  and  nonlinear  systems  modeled 
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In  either  the  time  or  frequency  domain. 
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ii).  It  can  be  used  to  locate  multiple  hard  or  soft  faults, 
iii).  Finally,'  it  is  capable  of  locating  failures  in  "replaceable 
Modules"  such  as  an  IC  chip,  a  PC  board,  or  a ^subsystem  rather 
than  discrete  components. 

As  of  the  present  time  we  have  completed  the  algorithm  development 
work  related  to  the  new  algorithm  while  we  are  presently  doing  a  pre¬ 
liminary  investigation  of  the  software  engineering  problems  which  must 
be  resolved  as  a  prerequisite  to  implementing  the  algorithm  in  a  user 
oriented  software  code. 


T exas  Tech  University 
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Research  Unit:  1 


1.  Title  of  Investigation:  Feedback  System  Design 

2.  Senior  Investigator:  Richard  Saeks  Telephone:  (806^742-3528 

3.  JSEP  Funds:  $25,875 

4.  Other  Funds: 

5.  Total  Number  of  Professionals:  PI  1  (1  mo.)  RA  1  (1/2  time) _ 

6.  Summary: 

Although  control  theorists  have  studied  higher  order  and  multivariate 
systems  for  more  than  a  quarter  of  a  century  this  research  has  had  little 
impact  on  the  DOD  community  in  which  single  loop  PI  designs  still  predominate. 
Indeed,  such  controllers  represent  the  physical  limit  of  what  can  be  achieved 
with  the  hydraulic  and/or  analog  electronic  hardware  which  is  traditionally 
used  to  implement  a  control  system.  With  the  advent  of  the  digital  control 
computer,  however,  higher  order  and  multivariate  controllers  have  become  a 
reality,  wherein,  one  can  implement  any  desired  compensator  design  simply 
by  burning  the  appropriate  program  into  a  ROM. 

Given  the  renewed  interest  in  higher  order  multivariate  control  brought 
about  by  the  digital  control  computer  the  present  work  unit  is  directed  toward 
the  problem  of  developing  an  efficient  algorithmic  design  procedure  for  linear 
multivariate  control  systems  using  frequency  domain  techniques.  Our  approach 
is  based  on  a,  now  classical,  result  of  Youla,  Bongiorno  and  Jabr  in  which  an 
explicit  parameterization  of  the  set  of  compensators  which  stabilize  a  given 
plant  is  formulated.  Indeed,  with  the  aid  of  a  modified  parameterization 
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due  to  Desoer,  Liu,  Murray,  and  the  author  one  can  parameterize  the  set 
of  compensators  for  a  given  plant  in  such  a  manner  that  the  resultant  feed¬ 
back  system  gains  are  linear  (actually  affine)  in  the  design  parameter. 

This,  in  turn  greatly  simplifies  the  process  of  specializing  the  design  to 
meet  additional  constraints  and,  as  such,  a  powerful  design  theory  has  been 
obtained  which  includes  the  txacklng  and  dvstuxbance  xe.jzcJU.on  pxoblems,  the 
pole  placement  pxoblem,  xobust  design,  design  with  stable  ox  pxopex  compen- 
satoxs,  and  the  model  matching  pxoblem  as  well  as  the  beginnings  of  a 
simultaneous  design  and  adaptive  contxol  tkeoxy. 
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I.  Introduction 

Hr  HE  FEEDBACK  system  design  problem  may  natu- 
*  rally  be  subdivided  into  two  tasks: 

1)  satisfaction  of  design  constraints,  and 

2)  “optimization’*  of  system  performance. 

The  first  and  foremost  design  constraint  is  stability. 
Additionally,  system  specifications  may  also  include  a 
tracking  and/or  disturbance  rejection  constraint  Once 
them  constraints  have  been  satisfied,  the  remaining  design 
latitude  may  then  be  used  to  “optimize”  the  qualitative 
performance  of  the  system:  cost  energy  consumption, 
overshoot  reliability,  complexity,  etc. 

The  purpose  of  the  present  paper  is  to  report  on  the 
results  of  a  research  program  whose  goal  is  the  formula* 
tion  of  a  new  algebraic  fractional  representation  approach 
to  the  feedback  system  design  problem.  The  key  to  this 
approach  is  a  design  philosophy  pioneered  by  D.  C 
Youla,  in  which  one  parameterizes  the  entire  solution 
space  for  the  design  problem,  rather  than  simply  con¬ 
structing  a  single  solution  f  15]— [17].  The  approach  is  ide¬ 
ally  suited  for  the  feedback  system  design  problem, 
wherein  one  can  satisfy  several  design  constraints  by 
sequentially  reparameterizing  the  controller  as  additional 
constraints  are  imposed  with  the  system  performance  being 
“optimized”  over  the  final  parameterization.  In  [15]  and 
[16]  Youla,  Bongiomo,  and  Jabr  parameterized  the  stabi¬ 
lizing  controllers  for  single-variate  and  multivariate  feed¬ 
back  systems,  respectively,  and  showed  that  the  optimiza¬ 
tion  of  the  system  performance  over  the  resultant 
parameteer  space  reduced  to  a  standard  Wiener-Hopf 
problem.  More  recently  Youla  has  adopted  a  similar  ap¬ 
proach  to  the  design  of  stochastic  estimators  in  which  the 
set  of  covariances  which  interpolate  one’s  observations  of 
a  stochastic  process  are  parameterized  [17]. 

Manucript  rec*Tv«d  January  2,  1980;  rtvUad  October  8,  1980.  This 
week  was  supported  in  past  by  the  Joint  Services  Electronics  Profnun  at 
Texas  Technical  University  under  ONR  Contract  76-C-l  13d. 

The  authors  are  with  the  Department  of  Electrical  Fniinaarins,  Texas 
Technical  University,  Lubbock,  TX  79409. 


The  present  research  program  began  with  the  derivation 
of  a  simple  algebraic  proof  for  the  Youla/Bongiomo/Jabr 
stabilization  theory  and  the  generalization  of  the  theory  to 
include  a  variety  of  “system  placement”  problems  in  a 
general  linear  system  setting  [6].  The  spirit  of  this  work 
was  very  dose  to  that  of  several  other  authors  in  the  area 
of  “algebraic”  control  theory  [I],  [2],  [4J,  [7],  [12],  [13]. 
Among  the  problems  which  have  been  studied  by  these 
authors  are  those  of  tracking  and  disturbance  rejection.  In 
the  present  paper,  we  study  these  problems  in  a  general 
axiomatic  setting.  Two  restrictions  are  made,  however. 
First,  we  assume  that  the  disturbances  affect  the  system  in 
a  very  simple  way,  and  second,  we  assume  that  the  de¬ 
nominators  of  the  reference  generators  commute  with 
everything.  (See  Section  V  for  a  more  detailed  explana¬ 
tion.)  With  this  simplification,  it  is  again  possible  to 
obtain  a  complete  parameterization  of  all  controllers  which 
achieve  the  prescribed  design  contraints  by  sequentially 
reparameterizing  the  controllers  as  additional  design  con¬ 
straints  are  imposed.  Moreover,  the  feedback  system  gains 
which  result  from  such  a  controller  are  linear  (affine)  in 
the  resultant  design  parameter,  thereby  simplifying  the 
second  task  referred  to  above,  namely  optimization. 

Throughout  the  paper  we  will  work  with  the  feedback 
system  of  Fig.'  1.  Here,  P  is  a.  given  plant  and  C  is  the 
compensator  to  be  designed  while  T  and  R  are  reference 
generators  that  model  the  inputs  to  be  tracked  and  re¬ 
jected,  respectively.  (The  sense  in  which  these  generators 
model  inputs  will  be  made  precise  in  Section  III.)  The 
problem  of  parameterizing  the  compensators  that  stabilize 
the  feedback  system,  termed  problem  S,  was  the  subject  of 
[6]  and  will  serve  as  the  starting  point  for  the  present 
paper  in  which  we  investigate  three  additional  design 
problems: 

Problem  ST:  Parameterize  the  compensators  that 
simultaneously  stabilize  the  system  and  cause  it  to  track 
the  response  of  T. 

Problem  SR:  Parameterize  the  compensators  that 
simultaneously  stabilize  the  system  and  cause  it  to  reject 
the  response  of  R. 

H  ~ 
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Problem  STR :  Parameterize  the  compensators  that 
simultaneously  stabilize  the  system,  cause  it  to  track  the 
response  of  T,  and  reject  the  response  of  R. 

Of  course,  the  concepts  of  stability,  tracking,  and  dis¬ 
turbance  rejection  will  be  made  precise  as  required. 

This,  of  course,  is  by  no  means  the  most  general  possi¬ 
ble  system.  In  particular,  the  disturbance  is  assumed 
merely  to  be  added  to  the  input,  and  the  measured  output 
is  assumed  to  be  the  same  as  the  regulated  output  We 
remark  in  passing  that  the  problem  of  rejecting  an  addi¬ 
tive  disturbance  at  the  output  is  mathematically  equiva¬ 
lent  to  the  tracking  problem. 

In  the  remainder  of  this  introduction  our  algebraic 
fractional  representation  theory  is  reviewed  and  the  re¬ 
sults  on  the  stabilization  problem,  derived  in  [6],  are 
summarized.  In  Section  II  a  more  powerful  “doubly 
coprime”  fractional  representation  theory  is  formulated 
and  its  properties  developed.  In  the  following  section  the 
concepts  of  asymptotic  tracking  and  disturbance  rejection 
are  defined  in  our  algebraic  setting  and  theorems  char¬ 
acterizing  the  stable  feedback  systems  which  track  the 
response  of  T  and/or  reject  the  response  of  if  are  ob¬ 
tained.  The  fundamental  feedback  system  design  equa¬ 
tions  are  derived  in  Section  IV,  wherein  it  is  shown  that 
problems  ST,  SR,  and  STR  each  reduce  to  the  solution  of 
a  linear  equation  in  the  ring  of  stable  operators.  A  neces¬ 
sary  and  sufficient  condition  for  the  existence  of  a  solu¬ 
tion  of  these  equations  and  the  desired  parameterization 
of  the  solution  space  are  obtained  in  a  “generalized  multi¬ 
variate”  class  of  systems  in  Section  V.  Finally,  some 
examples  of  the  theory  are  presented  in  Section  VI. 

Our  algebraic  theory  [6]  is  set  in  a  nest  of  rings,  groups, 
and  multiplicative  structures 

GdHoIz>J. 

Here,  G  is  a  ring  with  identity  that  represents  the  general 
class  of  systems  with  which  we  wish  to  work:  rational 
matrices,  continuous  operators,  a  class  of  transcendental 
functions,  etc.  H  is  a  subring  of  G  containing  the  identity 
that  models  the  systems,  which  are  stable  in  some  sense: 
poles  in  a  prescribed  region,  transcendental  functions  with 
restricted  singularities,  stable  operators,  etc.  Finally,  I 
denotes  the  multiplicative  set  composed  of  elements  of  H 
that  admit  an  inverse  in  G,  while  J  denotes  the  multiplica¬ 
tive  subgroup  of  /  made  up  of  elements  that  are  invertible 
in  H.  Detailed  examples  of  the  axiomatic  structure, 
{<7,  H,  I,  J)  were  given  in  [6]  and  will  not  be  repeated 
here. 

We  say  that  a  plant  P  has  a  right  fractional  representa¬ 
tion  in  {G,  H,1,J)  if 

P-P,Pr~'  (M) 

where  p,SH  and  p.  Si.  Furthermore,  we  say  that  this 
representation  is  right  coprime  if  there  exists  q,  and  in  H 
such  that 

q,Pr+M,- 1.  (1.2) 


This  equality  is  equivalent  to  the  classical  coprimeness 
concept  for  rational  functions  and  matrices  while  being 
well  defined  in  our  general  ring  theoretic  setting.  In  par¬ 
ticular,  if  G  is  the  ring  of  rational  functions  and  H  is  the 
ring  of  polynomials  (e.g.,  in  discrete-time  systems),  (1.2) 
implies  that  p,  and  /,  have  no  common  zeros.  If  G  is  the 
ring  of  rational  functions  and  H  is  the  ring  of  exponen¬ 
tially  stable  rational  functions,  (1.2)  implies  that  pr  and  p, 
have  no  common  right-half  plane  zeros. 

Since  the  ring  G  is,  in  general,  noncommutative,  we  also 
define  a  left  fractional  representation  for  P  via  the  equality 

P-p}'p,  (1-3) 

for  />,£//  and  p,  6/.  Furthermore,  we  say  that  this  repre¬ 
sentation  is  left  coprime  if  there  exists  q,  and  q,inH  such 
that 

Pt4t  +p/ii  “  1.  (1.4) 

Of  course,  in  the  classical  case  of  a  rational  function  or 
matrix  these  fractional  representations  are  assured  to  ex¬ 
ist.  However,  this  is  not  the  case  in  the  general  ring 
theoretic  setting.  Therefore,  for  distributed,  time-varying 
and  multidimensional  systems,  we  assume  that  our  plant 
admits  such  a  representation  as  a  prerequisite  to  the 
theory. 

If  one  is  given  two  fractional  representations  for  a 
plant,  say  P~Xrx~l  and  where  the  second 

satisfies  die  coprimeness  condition  of  (12)  then  the  two 
representations  differ  by  a  greatest  right  dhoisor  (6],  r&H, 


Le, 

*rmPr r 

(13) 

and 

*rmPS- 

(1.6) 

To  give  an  idea  of  the  arguments  used  later,  we  prove  this 
fact  as  follows.  Since  pr  €/  the  unique  solution  of  (1.5) 
and  (1.6)  is  r—p~'xr  and  hence  we  must  show  that  this 
r^H.  To  this  end  we  invoke  (12)  obtaining 

rwpr'x, -(q,Pr+qrP,)Pr~% 

m1rPrPr~%  +<M,  +4r*r 

which  implies  that  r&H  since  it  is  expressed  as  the  sum  of 
products  of  elements  of  H.  Similarly,  if  P~pfxPi 
where  pf  'pi  is  left  coprime,  then  the  two  left  fractional 
representations  differ  by  a  greatest  /e/r- divisor  [6]  ISH 
such  that 

(1.8) 

and 

*,-£/•  (1.9) 

Thus,  our  abstract  fractional  representation  theory  is  quite 
similar  to  the  classical  theory  for  polynomials  and  rational 
functions  even  though  it  is  set  in  an  abstract  ring  and 
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includes  distributed,  time-varying,  and  multidimensional 
systems. 

Since  H  represents  a  ring  of  stable  systems  we  say  that 
the  feedback  system  of  Fig.  1  is  stable  if  all  of  its  internal1 
and  external  gains  are  elements  of  H.  By  using  the  above 
described  fractional  representation  theory  a  solution  to 
problem  5  was  obtained  in  [6].  In  particular,  if  the  plant  P 
is  given  by  P~pf  lPi>  then  the  compensator 

0(Hp/+$,r,(-*tj7/+f,)  (1.10) 

stabilizes  the  feedback  system  for  any  wEH  such  that 

(up, +#,)€/;  and  every  stabilizing  compensator  is  given 
by  (1.10)  for  some  wEH.  Moreover,  when  this  compensa¬ 
tor  is  used  in  the  feedback  system  all  of  the  usual  gains  of 
the  closed-loop  system  are  linear  (actually  affine)  in  w. 
Specifically,  the  system's  input/output  gain  is  given  by 

A,„  *  ~Pr»Pt  +p,q,  (1-11) 

while  the  system’s  input/error  gain  takes  the  form 

A.,  “  1  +Pr»Pl  ~PAr-  (1*12) 

The  remaining  gains  are  also  linear  in  w  and  are  derived 
in  [6]. 

We  note  that  even  though  our  approach  to  the  feedback 
system  design  problem  is  formulated  in  a  ring,  no  “ring 
theory”  is  employed.  Indeed,  the  only  mathematics  re¬ 
quired  in  the  entire  paper  is  addition,  multiplication,  sub¬ 
traction,  and  inversion. 

Finally,  the  above  notation  exemplifies  a  pattern  that 
we  will  try  to  follow  throughout  the  paper,  namely,  an 
input- output  operator  win  be  denoted  by  a  single  upper¬ 
case  letter  (P\  its  coprime  factorizations  will  be  denoted 
by  the  corresponding  lowercase  letter,  with  a  tilde  over  the 
denominator,  and  subscript  /  or  r  for  left  or  right  (e.g, 
prp~l),  and  last,  the  elements  appearing  in  the  coprime- 
ness  equation  will  be  the  next  letter  in  the  alphabet,  with 
tilde  and  subscripts  matching  the  elements  which  they 
multiply  (e.g^  q,pr  +p,qr  -  !)• 

II.  Doubly  Coprime  Fractional 
Representations 

In  the  stabilization  theory  discussed  in  the  previous 
section  it  was  assumed  that  the  plant  admitted  both  left 
and  right  coprime  fractional  representations 

P-P,P,~'  -Pf'Pt  (2-1) 

for  which  there  exist  qr  EH  and  Q,  E  H  such  that 

1,P,+i,P'm  1  (2.2) 

and  q,  EH  and  q,  EH  such  that 

lOer  concept  of  stability  ben  rtqulm  that  *Q  internal  system  pun 
an  stable  (Le,  in  H)  in  addition  to  the  fains  observed  between  the 
actual  inputs  and  outputs.  These  concepts  an  discussed  in  non  detail  in 

ra. 


Pi4i+Pi4i- 1-  (2-3) 

Although  this  structure  sufficed  for  the  stabilization  the¬ 
ory  of  problem  S,  one  can,  in  fact,  adopt  a  stronger 
structure  without  loss  of  generality  [3]. 

Property  1:  Assume  that  P  admits  both  left  and  right 
coprime  fractional  representations  such  that  (2.1)  through 
(23)  are  satisfied.  Then  there  exist  q]  and  q]  in  H  such 
that 


PAi  +PA!  -  1 


(2.4) 


and 


M -94!' 


(23) 


Proof:  Recall  that  the  inverse  of  a  two  by  two  matrix 
with  elements  in  a  noncommutative  ring  is  given  by 


X 

Z 


Y 

W\ 


-i 


A-' 

-WK-'ZA-1 


-A-'np-' 
fV-'  +  W-'ZA~'YlV- 
(2.6) 


where  A-jf- YW~lZ,  provided  the  indicated  inverses 
exist  Applying  this  to  the  matrix 


4, 

[  -Pi  Pi 


we  find 


A-‘-pr, 


and,  after  some  computation. 


where 


9r 

-i 

Pr 

-4} 

.  -Pi 

Pi 

.Pr 

r, 

and 


<t!  -PA At  +4 i  -PA At 


H  -it  -PA At  +PAAi- 


(2.7) 


(2.8) 


(2-9) 


(2.10) 


(2.11) 


Equations  (2.4)  and  (23)  now  follow  immediately.  ■ 
In  essence,  Property  I  implies  that  if  a  plant  admits  any 
pair  of  left  and  right  coprime  fractional  representations 
then  it  also  admits  a  stronger  fractional  representation  in 
which  the  q’i  intertwine  in  a  manner  similar  to  the  p’s. 
We  say  that  such  a  fractional  representation  is  doubly 
coprime  and  (dropping  the  primes)  we  denote  it  by 


(2.12) 


Since  a  doubly  coprime  fractional  representation  exists 
whenever  separate  left  and  right  coprime  fractional  repre¬ 
sentations  exist  we  may  work  with  a  doubly  coprime 
fractional  representation  without  loss  of  generality.  This 
in  turn,  means  that  rather  than  the  three  equalities  (2.1) 


4r 

4r 

-1 

m 

Pr 

~4, 

.  -Pi 

Pi. 

.Pr 

l. 
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through  (2.3)  we  have  eight  equalities  with  which  to 
manipulate  the  feedback  system  equations.  These  are  ob¬ 
tained  by  both  premultiplying  and  postmultiplying  the 
matrix  of  (2.12)  by  its  inverse  to  compute  the  identity  and 
take  the  form 


PtPr  mPlPr 

(2.13) 

4Alm<lAl 

(2.14) 

Mr+IrP,-  1 

(2.15) 

PA,  +Pi4i  "  l 

(2.16) 

Pa,  •hPt 

(2.17) 

pA,  m4tPt 

(2.18) 

PAr+hPtm  1 

(2.19) 

PA r  +ilPt  -  1- 

(2.20) 

Finally,  we  note  that,  if  in  the  stabilization  theory  in  [6] 
one  begins  with  the  above  doubly  coprime  representation 
for  the  plant,  the  representation  (1.10)  for  the  controller  is 
also  doubly  coprime:  a  simple  calculation  gives 

P,  P,  ~'f  P,»+$,  ~J », 

l -/,**+*/  P,\ 

III.  Analysis 

We  say  that  the  feedback  system  of  Fig.  1  tracks  T  if 
ktfT&H.  This  definition  may  be  justified  by  considering 
the  case  where  if  is  the  ring  of  exponentially  stable 
rational  functions.  Here,  if  one  lets  ^  be  the  impiilm 
response  of  T,  p (s)—T(s)  and 

p(s)-r(s)-*(s)»h,lt(s)p(s)-h"t(s)T(s).  (3.1) 

Therefore,  w  asymptotically  tracks  p  if  and  only  if 
ht„(r) T\s)  is  exponentially  stable,  i.e.,  htpT&H.  Note  that 
the  impluse  response  of  T  may  be  unbounded  (since  T 
may  be  unstable)  even  though  the  input  to  the  reference 
generator  is  bounded.  Of  course,  the  same  intuition  ap¬ 
plies  in  a  more  general  setting  where  the  condition 
H  implies  that  v  is  asymptotic  to  p  in  whatever  sense  the 
response  of  the  systems  represented  by  elements  of  H  is 
asymptotic  to  zero. 

Similarly,  we  say  that  the  feedback  system  of  Fig.  1 
rejects  R  if  h,nRGH.  Once  again  if  H  is  the  ring  of 
exponentially  stable  rational  functions  and  p  is  the  im¬ 
pulse  response  of  R,  p(s)-R(s)  and 

^U)mh^(s)p(s)»hrp(s)R(s),  P-2) 

which  is  asymptotic  to  zero  if  and  only  if  h^(s)R(s)  is 
exponentially  stable,  i.e.,  hrfReH.  Thus,  the  system  re¬ 
jects  R  in  the  sense  that  its  response  to  the  impulse 


Fl*.  2.  Stable  feedback  system. 


response  of  fl  is  asymptotic  to  zero.  See  £1),  [3J,  [4],  and  (7) 
for  a  fuller  discussion  of  these  concepts  of  tracking  and 
rejection. 

Now,  insider  the  feedback  system  of  Fig.  2.  Adopting 
an  argument  similar  to  that  employed  by  Francis  £7]  in  a 
frequency  domain  setting,  we  obtain  the  following  the¬ 
orem,  which  characterizes  the  stable  feedback  systems 
which  track  T  and/or  reject  if.  For  this  purpose  we 
assume  that  P,  T,  and  R  each  admit  left  coprime  frac¬ 
tional  representations 


p~pr'p, 

(3J) 

T-t'r't, 

(3.4) 

and 

R-f^'r, 

(3.3) 

where 

PA/  +PAt m  1 

(3.6) 

(3.7) 

and 

r,s,  +r,s,  - 1. 

(38) 

Theorem  1 :  Assume  that  the  feedback  system  of  Fig.  2 
is  stable.  Then,  the  system 

1)  tracks  T  if  and  only  if  f,  is  s  right  divisor  of  p„  in  the 
sense  that  p,if 1  €/f; 

2)  rejects  R  if  and  only  if  f,  is  a  right  divisor  of  p„  in  the 
same  sense. 

Proof:  We  have 


(3.9) 

(3.10) 

and  so  h,^i,~'t,  GHemhvt,~l  SH.  Now 

*vm( Pr+P/)~'Pr 

(3.11) 

and  since  the  system  is  stable,  we  know  (from  [6])  that 
P,  +p,  €/. 

Therefore, 

hj,-'  e  h~(p,  +p,ylplfrt  en 

**Pii,~ 1 Stf, 

(3.12) 

and  part  1)  follows.  Pan  2)  is  proved  similarly. 

■ 
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IV.  Thb  Design  Equations 

We  now  turn  our  attention  to  the  system  design  prob¬ 
lems  ST,  SR,  and  STR.  Since  in  each  of  these  problems 
the  compensator  must  stabilize  the  system  in  addition  to 
causing  it  to  track  T  and/or  reject  R,  the  theory  of  [6] 
implies  that  our  compensator  must  take  the  form  of  (1.10) 
with  weH  chosen  to  assure  that  ht^TeH  and/or  h,nR  £ 
H.  For  this  purpose  we  assume  that  the  plant  P  admits  a 
doubly  coprime  fractional  representation  (112)  while  T 
and  R  are  assumed  to  admit  left  coprime  fractional  repre¬ 
sentations  as  in  (3.4),  (3.5),  (3.7),  and  (3.8). 

Theorem  2:  For  die  feedback  system  of  Fig.  1: 

1)  problem  ST  is  soluble  if  and  only  if  the  equation 

prwp,  +xi,  -prq,  - 1  (4.1) 

admits  solutions  w  and  x  in  H; 

2)  problem  SR  is  soluble  if  and  only  if  the  equation 

Pr”P,  +y?t  -P,q,  (42) 

admits  solutions  w  and  y  in  H; 

3)  problem  STR  is  soluble  if  and  only  if  (4.1)  and  (4.2) 
admit  simultaneous  solutions  w,  x,  an dy  in  H. 

Moreover,  when  a  solution  exists  the  set  of  compensa¬ 
tors  which  solve  the  problem  is  given  by 

C-(Hpl+q,)”,(~N5p,+4,)  (4.3) 

where  w  is  any  solution  of  the  appropriate  equation(s), 
such  that 

wp,  +$,  ei. 

Proof:  Since  every  stabilizing  compensator  is  given 
by  (1.10)  for  an  arbitrary  w  eH  with  the  resultant  hu 
given  by  (1.12)  the  w's  that  are  both  in  H  and  simulta¬ 
neously  place 

h.Jm[  1+FM-M,]*/"1*/  (4.4) 

in  H  define  the  class  of  compensators  which  solve  prob¬ 
lem  ST.  In  the  course  of  proving  Theorem  1,  we  proved 
that  htpTeH  if  and  only  if  eH.  Now  suppose  that 

h,Sr'm-xeH.  (4.5) 

Then, 

l  +P,»Pt  - PA ,  -  -  xi„  (4.6) 

which  is  equivalent  to  (4.1). 

Conversely,  if  (4.1)  admits  solutions  w  and  x  in  H,  then 
using  this  w  in  (1.10) ^yields  a  compensator  which  stabilizes 
the  system  with  1  ■  —xeH;  that  is,  r  compensator 
which  solves  the  problem  ST. 

The  proof  of  2)  is  obtained  by  a  parallel  argument  with 
h,0  replaced  by  and  T  replaced  by  R  while  3)  is 
obtained  by  recognizing  that  the  solution  of  problem  STR 
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requires  a  simultaneous  solution  of  problems  ST  and  SR 
with  the  same  compensator,  hence  the  same  w.  ■ 

Following  14,  we  define  pt  and  t,  to  be  skew  coprime  if 
there  exists  j  and/  in  H  such  that 

Pj+Ji,  - 1- 

We  then  can  state  the  following  corollary.  (Compare  [2], 
[41,  [13D- 

Corollary  1:  For  the  feedback  system  of  Fig.  I, 

1)  a  necessary  condition  for  problem  ST  to  admit  a 
solution  is  that  p,  and  t,  are  skew  coprime; 

2)  a  necessary  condition  for  problem  SR  to  admit  a 
solution  is  that  p,  and  r,  are  right  coprime; 

3)  a  necessary  condition  for  problem  STR  to  admit  a 
solution  is  that  pr  and  t,  are  skew  coprime,  p,  and  r,  are 
right  coprime,  and  e,  and  r,  are  right  coprime. 

Proof:  If  problem  ST  admits  a  solution,  (4.1)  is  satis- 
fled  for  some  w  and  x  in  H,  hence,  upon  rearranging  the 
terms  in  this  equation,  we  have 

Pr[  < l ,  -*?>/]  +  [-*]'’/  “Pj+jt  - 1  (4.7) 

where  j—q,  -wp,  and  /■*  —x.  To  verify  2)  we  substitute 
(220)  into  (42)  to  obtain 

PWi  +yf,  »PA,  - 1  ~  i,P,  (4.8) 

and  rearrange  terms  to  obtain 

+  (4.9) 

as  required.  Finally,  if  problem  STR  admits  a  solution, 

(4.1)  and  (42)  are  satisfied  with  the  same  w,  hence,  so  are 
(4.7)  and  (4.9).  To  obtain  the  final  coprimeness  condition 
we  subtract  (4.1)  from  (42),  obtaining 

M'/ +  [-•*]'/ -1  (4.10) 

and  thereby  completing  the  proof.  g 

Note  that  the  coprimeness  condition  of  (4.10)  essen¬ 

tially  says  that  one  cannot  simultaneously  track  and  reject 
the  same  signal,  and  is  therefore  a  natural  auxiliary  condi¬ 
tion  to  guarantee  the  simultaneous  solvability  of  the  track¬ 
ing  and  disturbance  rejection  problems. 

Note  also  that  (4.1)  is  linear  in  the  unknowns.  It  follows 
that  if  this  equation  is  solvable  the  solution  space  will  be  a 
linear  manifold  in  H  which  can  be  represented  in  the  form 
w~Lo+d  where  Z-  is  an  appropriate  linear  operator  on  H 
and  o eH  becomes  our  new  design  parameter.  The  com¬ 
pensators  which  satisfy  the  constraints  of  problem  ST 
thus  take  the  form 

C-[(Lo+d)p,+^]“'[-(Lo+d)^+^]  (4.11) 

and  similarly  for  problems  SR  and  STR.  Moreover,  upon 
substituting  the  expression  tv«£o+dinto  (1.11)  and (1.12), 
we  obtain  expressions  for  the  resultant  feedback  system 
gains  that  are  linear  in  the  new  design  parameter  o.  Thus, 
all  of  the  properties  associated  with  the  solution  of  prob¬ 
lem  S,  given  in  [6],  are  retained  by  the  solutions  of 
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problems  ST,  SR,  and  STR  of  the  present  paper.  It 
remains,  however,  to  derive  explicit  necessary  and  suffi¬ 
cient  conditions  for  the  existence  of  solutions  to  these 
problems  and  to  formulate  explicit  expressions  for  the 
parameterization  of  the  resultant  solution  space. 

Finally,  we  note  that  one  normally  desires  a  proper 
controller  C.  The  obvious  approach  is  to  try  to  find  the  set 

{w€/f|Hp, +$,€/}; 

if  one  then  takes  G  to  be  a  ring  of  proper  transfer 
operators,  one  has  obtained  all  proper  stabilizing 
controllers.  Unfortunately,  in  the  setting  of  general  rings, 
the  above  set  may  be  empty.  Therefore,  rather  than  im¬ 
pose  the  additional  structure  necessary  for  a  general  the¬ 
ory,  we  indicate  what  can  be  done  in  the  classical  multi- 
variable  case.  The  additional  condition  needed  in  this  case 
is  quite  weak;  we  merely  require  that  for  any  constant 
matrix  A,  there  is  a  wSH  such  that 

w(oo 

If  we  now  take  G  to  be  the  ring  of  proper  transfer 
matrices,  it  is  easy  to  see  that  the  above  set  is  nonempty, 
as  follows. 

Since pr  €l,p~\co)  exists.  Then,  from  (2.19), 

(  P7  '(«)9/(<»))f»/(«) +&(«) mPr~  '(«)- 


transfer  functions  which  are  stable  in  an  appropriate 
sense.  More  generally,  multidimensional  systems  are  in¬ 
cluded  in  our  theory  with  K  taken  to  be  C"  and  L  an 
appropriate  polydisk. 

Using  the  above  function  spaces  we  formulate  an 
axiomatic  structure  { G,H,I,J }  by  letting  G-Af"*"  be 
the  set  of  a  by  a  matrices  whose  entries  are  elements  of  K 
and  H—LmXm  be  the  set  of  a  by  a  matrices  whose  entries 
are  elements  of  L.  Of  course,  /  and  J  may  be  constructed 
from  G  and  H  in  a  natural  manner.  Finally,  we  may 
embed  L  into  H—L"x*  by  identifying  /€L  with  the  a  by 
a  matrix  /I.  Under  this  embedding  each  /€L  commutes 
with  every  g€G«Ar"x"  and  A6/f-Z."x\ 

Although  we  will  normally  work  with  general  fractional 
representations  in  ((7,  H,  I,  J),  in  order  to  parameterize 
our  solution  spaces  we  need  to  work  with  a  fractional 
representation  Af—mm-1  where  meZ,"x*  and 

mSL.  Such  fractional  representations  naturally  model  the 
situation  where  the  denominator  of  a  fractional  represen¬ 
tation  is  just  the  common  denominator  of  each  entry  in  an 
n  by  n  matrix.  Since  L  commutes  with  H,  m  and  m  define 
both  left  and  right  fractional  representations  for  M.  More¬ 
over,  a  standard  function  space  argument  using  the  usual 
rank  coprimeness  for  matrices  will  reveal  that 
is  right  coprime  if,  and  only  if,  is  left  coprime. 

Therefore,  we  may  say  that  m  and  m  are  coprime  without 
a  qualifier  and  assume  a  doubly  coprime  representation 


Now,  if  we  let 


*r 

"r 

-i 

m 

-”t 

—  m 

m 

m 

*1, 

(5.1) 


it  follows  that,  for  all  *€//„,  wp,+qrel.  In  fact,  it  is 
easy  to  show  that  w e/f*  parameterizes  all  strictly  proper 
controllers.  In  general,  one  can  show  that  it  is  possible  to 
find  a  controller  with  C(oo)«A  if  and  only  if  (p,(oo)A  + 
/;(«)) -l  exists,  and  that  the  set  of  all  such  stabilizing 
controllers  is  parameterized  by  the  set  of  all  wS/f,,  where 

{  w€/f  |  w(  oo)  «  (qr(  oo)—$,(  ao  )ri  )(/,(«) +p,(oo)i4)}. 

Similar  considerations  apply  to  the  parameterizations 
occurring  in  subsequent  sections  of  the  paper. 

V.  The  Muluvajuatb  Case 

In  this  section  we  consider  the  solution  of  the  feedback 
system  design  equations,  (4.1)  and  (4.2),  for  a  class  of 
generalized  multivariate  systems  which  includes  most  time- 
invariant  feedback  systems  encountered  in  engineering 
practice.  To  this  end  we  let  K  denote  a  commutative  ring 
of  complex  valued  functions  defined  on  a  complex  mani¬ 
fold  K  which  includes  the  constant  function,  1,  and  we  let 
L  denote  the  subring  of  K  composed  of  functions  which 
are  analytic  on  a  submanifold  L  Normally,  it  is  the 
complex  plane  and  £  is  a  half-plane  or  disk  with  K 
representing  any  of  the  standard  spaces  of  single-variate 
transfer  functions  and  L  representing  the  subspace  of 


without  loss  of  generality.  Note,  since  H~LnXm  is  non- 
commutative  we  may  have  nr+n,  and/or 
In  this  generalized  multivariate  setting  we  would  like  to 
derive  explicit  solutions  of  the  feedback  system  design 
problems  ST,  SR,  and  STR.  To  this  end  we  assume  that 
the  plant  P  admits  a  doubly  coprime  fractional  represen¬ 
tation  in  our  multivariate  setting  while  the  reference  gen¬ 
erators  T  and  R  are  assumed  to  admit  doubly  coprime 
fractional  representations 

-t  i 
and 

K  ir 

—r  r 

where  t  and  f  lie  in  L. 

This  assumption  represents  a  restriction  on  our  system 
to  the  effect  that  the  coprime  denominator  of  the  transfer 
matrix  T  is  a  common  denominator  for  the  elements  of  T, 
and  similarly  for  R  [see  (3)].  Since  the  signals  to  be 
tracked  or  rejected  are  by  definition  the  response  of  these 
opera  ton  to  a  delta-function,  our  situation  is  dearly  anal¬ 
ogous  to  that  in  which  the  signals  are  generated  as  the 
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t  -u, 


(5-2) 
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zero-input  response  of  a  set  of  ordinary  differential  equa¬ 
tions. 

A.  Tracking 

With  the  above  restriction  on  the  matrix  T,  we  now 
proceed  to  parameterize  the  set  of  ail  controllers  which 
solve  the  problem  ST,  i.e„  those  which  simultaneously 
stabilize  the  system  and  cause  it  to  track  the  impulse 
response  of  T.  As  in  (6)  our  approach  will  be  to  find  a 
particular  solution  to  (4.1),  and  then  to  And  the  general 
solution  of  the  corresponding  homogeneous  equation. 
Lastly,  we  show  that  these  are  the  only  solutions.  In  all 
three  steps,  extensive  use  is  made  of  the  various  coprime- 
ness  conditions,  and  of  the  fact  that  t  commutes  with 
everything. 

First  we  observe  that  the  tracking  problem  admits  a 
solution  if  and  only  if  p,  and  i  are  coprime  [see  (2),  (4),  (13)]. 
The  necessity  of  this  condition  follows  from_  the  skew 
coprimeness  condition  of  Corollary  1-1)  since  tSL.  Con¬ 
versely,  if  p,  and  t  are  coprime,  then  by  the  dismission 
preceding  equation  (5.1),  there  exist  j„  j„  j„  and  j,mH 
such  that 


and  xk  -  -proe,  (5.11) 

are  solutions  to  the  homogeneous  equation  corresponding 
to  (4.1). 

To  verify  that  (5.11)  represents  all  homogeneous  solu¬ 
tions  to  (4.1)  we  consider  arbitrary  homogeneous  solutions 
w'k  and  xj  in  H  satisfying 


PrKP,  +**'*“0- 

(5.12) 

Now,  let  o'-w*e/*\  which  yields 

w'„  -o'e, 

(5.13) 

and 

K  ”  -ptKP,i~l-  -py^r'e,  -  -Pfi'e, 

(5.14) 

verifying  that  w'h  and  x'h  are  of  the  required  form.  It 
remains  to  show  that  v'SH.  To  this  end  consider  the 
string  of  equalities  in  which  the  coprimeness  conditions  of 
(5.4)  and  (5.7)  are  invoked  along  with  the  commudvity  of 

i. 


+'///) 


I  Jr 

.  -P'  ‘ 

Now,  by  invoking  (2.20)  and  the  fact  that  i  commutes 
with  H  we  have 


mW'Jl  +  {j,P,  +j,i)»k?r'*lfl 
m"hfl  -jrX'kft  +j,r»’jr  '•ill 

-  </i  -Jr*kfi  +Jyjr  W/i 
m»'J,  -jrx'jt  +Jy»r%hJ, 


pa,  - 1  -  -Ipi  -  ~(p,Jt  +‘I)i/Pt 
-P,[  ~Mt\  Pi  +  [  ~M,p]  i 
-P'[W,]P> +  {*,}< 


(5.15) 

It  follows  that  (5.1 1)  represents  the  set  of  all  homogeneous 
solutions  to  (4.1)  and  the  desired  parameterization  of  the 
solution  space  is  given  by 


showing  that 

"V  ■  ~j,4i  and  -  -j,4,P,  (5.6) 

are  elements  of  H  which  satisfy  (4.1). 

To  parameterize  the  solution  space  of  (4.1)  we  assume 
that  E**p,i~x  admits  a  doubly  coprime  fractional  repre¬ 
sentation 


/, 


l, 


(5.7) 


w *•->/?/+«*/  and  x-  -M,p,  -p,oe,.  (5.16) 

Summarizing  the  above  development  we  have  the  follow¬ 
ing  theorem. 

Theorem  3:  For  the  multivariate  feedback  system  of 
Fig.  1  let  P,  T,  and  E  be  characterized  by  the  doubly 
coprime  fractional  representations  of  (2. 12),  (5.2%  and 
(5.7%  with  t  €  L.  Then  problem  ST  admits  a  solution  if,  and 
only  if,  p,  and  t  are  coprime,  in  which  case  the  set  of 
compensators  that  satisfy  the  constraints  of  problem  ST  is 
given  by 


Then 

P,i-'-ir'e, 

(5.8) 

and  so 

iiP,mt,t. 

(5.9) 

Therefore,  for  any 

P,[o*i]p, 

+  ( -ppxi]imPP*ii-ppx,imQ 

(5.10) 

verifying  that 

C-[( -j,4,  +  v*,)p, + 4,}  ~ ‘  [  -  (  -Ml  +  °*i)P,  +  <!,} 

(5.17) 

where  j,  and  e,  an  defined  by  (5.4)  and  (5.7%  respectively, 
and  o  is  an  .arbitrary  element  of  H  such  that  the  de¬ 
nominator  of  C  is  in  I.  Moreover,  the  feedback  system 
gains  resulting  from  the  use  of  such  a  compensator  are 
linear  (affine)  in  the  design  parameter  v.  ■ 

Finally,  since  i,  €/,  considerations  similar  to  those  at 
the  end  of  Section  IV  show  that  the  set  of  proper  con¬ 
trollers  is  nonempty. 
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B.  Disturbance  Rejection 


The  derivation  of  existence  conditions  for  the  solution 
of  problem  SR  and  the  parameterization  of  the  resultant 
solution  space  parallels  the  above  derived  solution  to 
problem  ST.  For  this  reason  only  a  partial  sketch  will  be 
given  here.  First,  we  observe  that  the  disturbance  rejection 
problem  admits  a  solution  if,  and  only  if,  p,  and  r  are 
coprime.  The  necessity  follows  from  condition  2)  of 
Corollary  1.  Conversely,  if  p,  and  r  are  coprime  there  exist 
m„  mr,  m„  and  m,  in  H  such  that 


m. 

-1 

r 

-m, 

.  ~Pt 

P 

.Pr 

m, 

Thus, 

PA,  mrPl) 

mP,[  4,mr]  Pt  +  [  WA]  f 

verifying  that 

yr -PAS', 


(5(18) 


(5.19) 

(5*20) 


satisfy  (4.2). 

Next  we  desire  to  parameterize  the  solution  space  of 
(4.2)  when  p,  and  P  are  coprixne.  To  this  end  we  assume 
that  A»P~'p,  admits  a  doubly  coprixne  fractional  repre¬ 
sentation 


i  br 

-I 

ar  -b. 

-a,  a, 

a,  6, 

Now,  since 

p~lpr-a,a;' 


{5-21) 


(5.22) 


by  an  argument  similar  to  that  employed  in  the  solution 
of  the  tracking  problem  we  find  that  the  homogeneous 
solutions  to  (4.2)  are  given  by 

wk *od  ?k  “  -<*r*Pi  (5-23) 

and  hence  the  desired  parameterization  of  the  solution 
space  for  42  takes  the  form 

w-q,mr+arz  (524) 


where  m,  and  a,  are  defined  by  (5.18)  and  (5.21),  respec¬ 
tively,  and  z  is  an  arbitrary  element  of  H  such  that  the 
denominator  of  C  is  in  /.  Moreover,  the  feedback  system 
gnin«  resulting  from  the  use  of  such  a  compensator  are 
linear  (affine)  in  the  design  parameter  z.  ■ 

Again,  since  <5, 6/  the  considerations  at  the  end  of 
Section  IV  are  applicable. 

C  Simultaneous  Tracking  and  Disturbance  Rejection 

To  obtain  a  solution  of  problem  STR  in  our  multi¬ 
variate  setting  we  must  find  4  simultaneous  solution  to 
(4.1)  and  (4.2),  using  the  same  w,  hence,  the  same  com¬ 
pensator.  Since  we  already  have  a  complete  parameteriza¬ 
tion  of  the  solution  spaces  for  these  equations  taken 
individually,  the  solution  of  the  simultaneous  problem 
reduces  to  finding  values  for  the  arbitrary  parameters,  v 
and  z,  which  yield  the  same  w.  In  short,  upon  combining 
(5. Id)  and  (5 25),  the  solution  to  problem  STR  may  be 
obtained  by  solving  the  linear  equation 

qrmr  +ipz  -  -j,4,  +  oe,  (5-27) 

for  t  and  o  in  H.  From  corollary  1-3)  we  have  the 
necessary  condition  that  p,  and  f,  p,  and  r,  and  t  and  P  all 
be  coprime.  Moreover,  since  i  and  f  are  both  in  L,  the 
coprimeness  condition  for  t  and  r  may  be  formulated  in 
terms  of  elements  t  and  Pin  L  as  follows. 

Lemma  1:  Assume  that  t  and  P  are  right  coprime  in  the 
sense  that  there  oust  t,  and  Pr  in  H  such  that  tri+PJ~  1. 
Then  there  exist  i  and  r  in  L  such  that 

ff+fr-1.  (528) 

Proof:  Recalling  that  r*  and  fr  are  matrices,  we  let  t 
and  r  be  the  (1, 1)  entries  in  /‘and  Pr,  respectively.  ■ 

Recall  that  e,~  le,  is  a  left  coprime  fractional  representa¬ 
tion  of  p,i~l  —i~lpt;  it  follows  that  t»k,e,  for  some 
k,  6/f.  Similarly,  F~arnr  for  some  «,  6 H.  We  use  this  to 
verify  the  sufficiency  of  the  three  coprimeness  conditions 
as  follows. 

We  rewrite  (5.27)  in  the  form 

oi,-dr2^qrmr+jlpl.  (529) 

Now,  starting  with  (528)  and  invoking  the  fact  that  L 
commutes  with  G  we  obtain 


and 

ymPA'™r~a'zP  (5-25) 

where  z  is  an  arbitrary  element  of  H. 

Theorem  4:  For  the  multivariate  feedback  system  of 
Fig.  1  let  P,  R,  and  A  be  characterized  by  the  doubly 
coprime  fractional  representations  of  (2.12),  (52),  and 
(521)  with  PSL.  Then,  problem  SR  admits  a  solution  if, 
and  only  if,  p,  and  P  are  coprime,  in  which  case  the  set  of 
compensators  which  satisfy  the  constraints  of  problem  SR 
is  given  by 

C-  [( qfm ,  +  d,z )p,  +  $,] " '  [  -  ( qrm,  +  irt)p,  +  ?,] 

(526) 


[  tj,mr  +;)<?/]  -  [  ?,«,  +j,4,]  it+r?[  qrmr  +jrf,] 

-  [  +Mt]  *kt*t  d,mr  +j,4,] 

-o,i,-arzf  (520) 

where 

v,  “  [  +JA,]  tki  and  -  nrP[  qrmr  +j,4,] 

(521) 

verifying  that  our  simultaneous  equations  admit  a  solu¬ 
tion. 

To  parameterize  the  resultant  solution  space,  we  ob¬ 
serve  that 
1ft 
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[i,d]i,  -«,[<£,]  -0  (5-32) 

implying  that 

ok-S,d  and  zk-di,  (533) 

are  homogeneous  solutions  to  (537)  for  all  dEH.  To 
verify  that  (533)  represents  all  homogeneous  solutions  to 
(527)  we  consider  an  arbitrary  pair  of  solutions  v'k  and  z'k 
in  if  such  that 

(534) 

We  now  let  d'-d"  V*.  in  which  case 

®i  -d,d‘  (535) 

and 

z'k  -  a,"  '»**,  -d'e„  (5  36) 

verifying  that  v’k  and  z’k  are  the  required  form.  It  remains 
only  to  show  that  d’ EH  lot  which  purpose  we  invoke  the 
various  coprim en ess  conditions  and  the  commutativity  of 
L  with  <7  as  follows. 

d'-d’V*  •(bra,+bj,)a-iv i 

+b,a,arlo'k(<lfl  +i,f,)t7 

-Vi  +b,a,a-larnji? 

+ b,a,a~  'v'jJtit + b,a,a- 'v'kt,if,i 

-Vi 

+b,a,a-lo'kelp,flt 

-Vi  +  brarnfi'ki+btarz'k3,ti 

+brarz’hp,f,ieH.  (537) 

Consistent  with  the  above,  the  required  parameterization 
for  the  solution  space  of  (539)  is  given  by 

and 

r--n,r(<7,m,  +/,$,]  +de,  (538) 

which  upon  substitution  into  (5.1$)  or  (535)  yields 

w-qrmr  +d,{  -n,r[  q,mr  +j,4, ]  +de,} 

- q,m,-rr[  q,m,  +j,ql]  +ardi, 

•tt(q,mr)-rf{M,)+afda,.  (539) 

We  have  thus  proven  the  following. 

Theorem  3:  For  the  multivariate  feedback  system  of 
Fig.  1  let  P,  T,  R,  E,  and  A  be  characterized  by  the 
doubly  coprime  fractional  representations  of  (2.12),  (52% 
(53),  (5.7),  and  (521)  with  F  and  t  in  L.  Then,  problem 
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STR  admits  a  solution  if  and  only  if  p,  and  t,  p,  and  F,  and 
I  and  F  are  coprime,  in  which  case  the  set  of  compensators 
which  satisfy  the  constraints  of  problem  STR  is  given  by 

O  {  [  ~  "(Mi) +*r<&t]  Pi  +  $,}  ~ ' 

•  {  “  [  «(*,«,)  -  "(Mi) + <*,<#/]  p,+qr)  (5.40) 

where  j„  i„  mr,  a„  f,  and  F  are  defined  by  (5.4%  (5.7), 
(5.18%  (521%  and  (528),  respectively,  and  d  is  an  arbitrary 
element  of  H  such  that  the  denoniinator  of  C  is  in  /. 
Moreover,  the  feedback  system  gams  resulting  from  the 
use  of  such  a  compensator  are  linear  (affine)  in  the  design 
parameter  d.  ■ 

Again,  the  considerations  at  the  end  of  Section  IV  are 
applicable. 

Although  we  have  gone  through  some  rather  complex 
derivations  in  the  preceding  it  should  be  noted  that  the 
only  mathematics  employed  are  addition,  multiplication, 
subtraction,  and  inversion.  Moreover,  the  results  of  these 
derivations  are  given  in  the  form  of  explicit  expressions 
for  the  compensators  that  satisfy  the  constraints  of  the 
three  design  problems. 

Although  the  expression  (5.40)  in  particular  looks  ex¬ 
tremely  complicated,  most  of  the  terms  occurring  in  it  are 
fixed  transfer  matrices;  it  can  be  rewritten  in  the  form 

C-  [(g0  +arde,)p,+qr]-l[  -(g0  +arde,)p,  +*,] 

or  even  in  the  form 

C-(g,^  +g,r,(g4‘frs  +*«) 
where  all  of  the  g,  are  fixed  transfer  matrices. 

VI.  Examples 

To  minimize  repetition,  our  first  two  examples  are  con¬ 
tinuations  of  examples  begun  in  [6],  where  the  coprime 
fractional  representations  employed  below  are  computed. 

A.  A  Single-Variate  Tracking  and 
Disturbance  Rejection  Problem 

Although  the  single-variate  case  is  already  well  under¬ 
stood,  the  theory  is  most  readily  illustrated  in  this  case, 
and  hence  we  begin  with  a  single  variate  example.  Here, 
the  plant  is  taken  to  be 


(4+») 

(s+1) 

(s-2) 

(r1  —4) 

(s+2)2 

(4+2) 

-p(s)p(s)~l  (6.1) 


where 


(j+0 

.  (i+2)J 


(j+2/3)  |f  (r— 2) 

(r+2)  J[(r+2) 
-f(»)fW+^)^)-l.  (62) 
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Of  course,  since  the  single-variate  problem  is  commutative 
these  g's  and  ^’s  yield  a  doubly  coprime  fractional  repre¬ 
sentation  in  {G,  H,  I,  J).  Here  G  is  taken  to  be  the  ring  of 
proper  rational  functions  and  H  is  the  ring  of  rational 
functions  whose  poles  have  a  real  part  which  is  less  than 
- 1.  Thus,  the  theory  will  yield  “strongly  stable”  systems 
in  the  sense  that  their  poles  will  be  bounded  away  from 
the  imaginary  axis.  Moreover,  since  the  theory  uses  the 
same  stability  concept  for  the  tracking  and  disturbance 
rejection  problems  as  for  stabilization,  the  resultant  solu¬ 
tions  to  these  problems  will  be  “strongly  asymptotic”  in 
the  same  sense. 

Now,  let  us  consider  the  problem  of  tracking  a  step 
function.  Here  we  let 


>+2)  [(r+2) 


-‘(s)i(sy 


4(r+l) 

.  (*+2) 


][  (r+2)  +  (r+2)  ][  (s+ 2) 


Moreover, 


mf  ('+1)  lif  *  If  1 

1  J  (s+ If  (•»+2)  j[  (s+2) 


W  [(yrI  +  16s+16)-(,-2)so(s)] 


h„(s)~  — 2-j  [r(r— 2)(r+2/3)+(s+lX*~2)o(r)] 
(r+2) 


in  terms  of  the  design  parameter  o(s).  Note  the  zero  at 
zero  in  h,^(s)  assures  that  the  system  win  track  a  step 
function  as  required. 

Now,  let  us  consider  an  alternative  tracking  problem 
where  we  are  required  to  track  e2,U(t)-  That  is, 

r(x)_f_L_l-[_L_.]f 

[(r-2)j  [(r+2)J[(r+2)J 


-r(r)r(r)- 


where 


(r+2)  J[  (r+2)] 
-u(r)r(r)+u(r)i'(r)-l.  (6.4) 


->(■»)/>(')+/(•»)<*(•»)  - 1  (6-5) 


M 


16  l[  (r+1) 
3  •'[  (r+2)2 


'  (*— 2) 
(s+2) 


- *(s)t(J)+S(,)i(s )  (6.12) 


verifying  that  p(s)  and  r(r)  are  coprime  and,  hence, 
assuring  the  existence  of  a  solution  to  our  tracking  prob¬ 
lem.  To  compute  the  solution  we  define 

-•(s)#(s)-'  (6.6) 

where 

r-n[l£l2L]+[2]f— £— 

1  Jl(s+2)J  1  J[(r+2)j 

1.  (6.7) 

From  S.16  we  then  have 

w(r)-  -j(s)g(s)+v(s)e(s) 

Substituting  this  value  of  w (r)  into  (1.11)  and  (1.12)  we 
obtain  the  system  gains 


(r+2/3)  1[  (r—2) 
(r+2)  (r+2) 


-Ks)p(s)+Rs)*is)^  1.  (6.13) 


Thus,  p(r)  and  i(s)  are  coprime  and  we  may  proceed  to 
construct  the  desired  set  of  compensators.  For  this  pur¬ 
pose  we  define  a  system  E(s)  by 

-l-[l]El]",-^W‘  (6-14) 


-[o][i]+[i][i]-/(r)«(z)+/(r)*(r)-i. 


Note  that  in  this  example  p(s)  and  f(s)  are  not  coprime. 
This  is,  however,  not  necessary  as  long  as  we  can  con¬ 
struct  a  coprime  fractional  representation  for  their  ratio, 
E(s).  Finally,  substitution  into  (5.16)  yields 
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and 


*«(*)• 


(s-2) 


9(s+2)4 

•  [(9sJ  -6s2  -  20s -  8) + 9(s +  l)(s+2)o(s) ] .  (6.18) 

As  before,  the  (s-2)  factor  in  the  numerator  of  the  htfj(s) 
verifies  the  tracking  property.  Also  note  that  as  c(s)  spans 
the  set  of  “strongly  stable”  functions  so  does  w(s).  Thus, 
every  compensator  that  stabilizes  the  system  in  our  sense 
also  solves  the  tracking  problem.  Although  redundant,  the 
extra  term  in  (6.16)  complicates  our  expression  for  C(s), 
A,,(s)  and  htfl(s)  and  should  be  eliminated  if  possible.  In 
our  theory  this  would  be  achieved  by  choosing  more 
opportune  fractional  representations  for  the  various  func¬ 
tions  with  which  we  deal. 

Now,  let  us  consider  the  problem  of  rejecting  a  step 
function  for  which  we  let 


Ms) 


•ris)fisrl 


(s+2) 


where 


f  4(s+ 1)  ]f  1 

4  - 

s 

(s+2)  ][  (s+2) 

[  (s+2) 

(s+2) 

Here, 


[2] 


(*+2) 


+  [-l] 


(s+i) 

s 

(s+2)2 , 

(s+2) 

ma(s)d(s) 


-l 


and 


M 


'  (s+ 1) 

4. 

[  *  If  *  1 

(s+2)J 

(s+2)  J[  (s+2) 

•*b(s)a(s)  +  £(j)d(s)- 1  (6.2' 


obtaining 


M*)ml(s)m(s)+S(s)t(s) 


/,  (,).£ii±ilf»(j+2)-(i-2Mr)]  (6.25) 
(s+2)4  3  J 

and  21 


M*)-  -—^1  [(,+2)(3j2  —  8s-  12)  +  3s(s+  l)r(s)]. 

J(J +2/ 

(626) 

Here,  the  factor  of  j  in  the  numerator  of  hrft(s)  indicates 
that  the  feedback  system  will,  indeed,  reject  step  func¬ 
tions.  Moreover,  as  we  have  previously  indicated,  any 
stabilizing  controller  tracks  e2,t/(r)  as  is  indicated  by  the 
(s— 2)  factor  in  the  numerator  of  A<#(s).  Interestingly,  if 
we  apply  our  theory  to  solve  this  simultaneous  tracking 
and  rejection  problem  we  obtain  an  equivalent,  but  more 
complex,  parameterization  of  the  desired  feedback  sys¬ 
tems.  To  this  end  we  let 


[2] 


s 

+  r  - 1  ] 

(s-2) 

(s+2)  \ 

L  *  J 

(s+2) 

(6.19) 


»r(s)r(s)  +  r(s)f(s)'- 1  (6.27) 

and  obtain 

w(s)-r(s)r(s)q(s)«(s) 

-r(s)H.s)j(s)q(s)+a(s)d(s)e(s) 

1 


■■s(s)/'(s)+l(s)/!(s)«,«  l.  (620) 

'  (s-2) 

(s+ 2)  J 

"•m(s)r(s)+m(s)p(s)“  1  (621) 


(s+2)2 


[  “  t(,j + ljr+4) +,(,+2)</(,)| 

(628) 

y  (-■  -  r*  f  -«-2x»«mw 

(629) 


and 


M*)’ 


(s-D 
9(s+2)5  J 


showing  that p(s)  and  r(s)  are  coprime.  Next  we  compute 


[(9s4  +  12sJ  +32s2  -  112s- 144) 

+  9s(s+l)(s+2)d(s)].  (6  JO) 


(622) 


(624) 


Equations  (6 28)  through  (6J0)  represent  a  complete 
parameterization  of  the  simultaneous  solutions  to  the 
tracking  and  disturbance  rejection  problem  where  d(s)  is 
a  design  parameteer  which  may  be  chosen  to  “optimize” 
some  other  aspect  of  the  feedback  system  design.  For 
instance,  if  we  would  like  to  create  an  additional  zero  of 
hT^(s)  at  sm  1  we  let  da*  -36.74  and  obtain 

w  (,).-ilillL(1280s2  +  768s— 2048).  (6J1) 
'  27(s+2)5 

B.  A  Multivariate  Lumped- Distributed  Disturbance 
Rejection  Problem 

We  now  consider  a  lumped/distributed  multivariate 
plant 


P(s)- 


(s-1) 


(6J2) 
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which  is  included  in  our  axiomatic  theory  by  taking  G  to 
be  the  ring  of  2  by  2  matrices  whose  entries  are  Lm 
functions  on  the  imaginary  axis  and  H  to  be  the  subring 
of  2  by  2  matrices  whose  entries  are  functions  on  the 
imaginary  axis.  We  woulu  like  to  design  a  compensator 
for  this  plant  which  will  simultaneously  stabilize  the 
feedback  system  and  cause  it  to  reject  the  sinusoidal  input 
sin (t)U(t).  To  this  end  we  obtain  a  doubly  coprime  frac¬ 
tional  representation 

-i-i 


Of  course,  since  the  r*s  and  s’s  are  all  commutative  (6.35) 
defines  a  doubly  coprime  fractional  representation  for 
R(s). 

To  solve  the  disturbance  rejection  problem  we  must 
now  verify  that  p,  and  r  are  coprime,  for  which  we  have 


<7,  <7, 

-Pi  Pi 


-i 


1 

0 

1  0 

0 

0 

1 

1  0 
i.  —  - 

_  _2_  _ 

el" 

(s-1) 

;  i 

i 

0 

(r+1) 

(j+1) 

0 

1 

i 

i  o 

(r-1) 

(r+1) 

1 

(r+1)  J 

l 


0 

2 


(*+!)* 


1 


0 

(s+1)  i 


1 

0 

0  1 
(*-D  ! 

(r+i)  i 

0 

0 

0 

-2 

(s-l)2  i 

1 

2(r-t) 

(s+1) 

(j+1)1  i 

(r+1) 

0 

1  1 

0 

1 

(*+l)  i 

Pr  “?/ 
Pr  St 


0  0 

+  o 

(r+1)2 

0  (,+3) 

o  (,2  +  1) 

(r+1) 

(-+!)■  J 

-m(j)//(j)  +  m(j)f(i)- 1. 


(6-36) 


As  before,  these  matrices  are  all  commutative  and  thus 
define  a  doubly  coprime  fractional  representation  for  our 
problem.  We  then  define 


(633) 


M*)mPr(*)'(- 0"  ” 


e-'/'U+l)  (s— 1)* 

(**+l)  (r*+l) 

o  iilil 

(*2  +  D 


and  we  let 


*(')- 


(*2  +  l) 


e-W  (r-1)2 

+  0 

0 

(r+1)  (J+1)J 

0  1  1 

1 

(r+1)  j 

[  (•*+  o2 

(sJ  +  l) 


i-t 


-a,(r)d,(r) 


-I 


(637) 


(s+l)J 


<*+l)2 


(fill) 

(s+l)J 


-i-i 


('*  +  0 
(*+l)2 


mr(s)r(j) 


-1 


(634) 


where 


which  is  right  coprime.  Of  course,  we  can  also  formulate  a 
left  coprime  representation  for  A(s)  along  with  the  ap¬ 
propriate  b’s  required  to  construct  a  doubly  coprime 
fractional  representation.  For  the  present  purpose,  how¬ 
ever,  all  that  is  required  is  3,(s)  and  hence  we  will  not 
derive  the  remaining  a’s  and  b' s.  Substituting  qr,  m,  and 
a,  into  (535)  now  yields  the  required  set  of  w's  for  the 
solution  of  problem  SR  for  our  lumped-distributed  multi- 


’  0 

1  0 

(,+3)  Q 

V  +  D  0  ] 

(«+!)  “ 

(r+  l)2 

+ 

(*+!)  ° 

(r+1)2 

o 

o  J 

0  <‘+3> 

o  ('*  +  »> 

(r+ l) 

(r+1)2. 

(r+1)2 

-r(r)r(r)+r(r)f(r)-l. 


(635) 
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variate  system. 


0  0 

[<'2+l>  0 

+ 

(J+l? 

n  4 

o  (,2  +  1) 

IV  _ 

L  (z+if  J 

(z+l)J 

(6.38) 

♦ 

where  z(z)  is  an  arbitrary  stable  matrix.  Finally,  substitu¬ 
tion  of  this  w(s)  into  1.10  yields  the  required  compensator 
while  the  input/output  gain  for  the  resultant  feedback 
system  takes  the  form 


Fig.  3.  Periodically  varying  feedback  system. 


the  ring  G,  composed  of  2  by  2  matrices  of  rational 
functions  which  have  the  form 


P^+zP^z2) 

P^+zP^z2} 


P3(z*)-zP4{z*) 

P^-zP^z2) 

(6.40) 


L 


2(r2  +  l)(r  +  3)(r-  l)1 
(r+1)3 

2(r2  +  l)(r-f  3) 


(r+1) 

0 


(r+1)2 

1 

(*+l) 


z2l(S) 


*a(*) 

*n(*) 


(*2  +  0 
(r+1)2 

0 


0 

(r2  +  l)(r— 1) 
(r+l)J 


(6J9) 


Note  that  the  factor  (r2  +  IX  in  the  numerator  of  A,^(r) 
implies  that  the  feedback  system  will  reject  the  required 
sinusoid,  while  choosing  z2l(r)»0  will  preserve  the  desira¬ 
ble  triangular  nature  of  the  plant 

C  A  Periodically  Varying  Discrete-Time  Tracking  Problem 


where  P„  i—  1, 2,  3, 4,  are  rational  functions.  As  usual,  the 
stable  systems  H  are  taken  to  be  the  subring  of  G  com¬ 
posed  of  functions  which  are  analytic  inside  the  unit  circle 
of  the  complex  plane.  All  of  the  constituent  components 
of  a  discrete-time  system  of  period  2  can  be  modeled  by 
such  matrices  and  the  usual  operational  calculus  for  inter¬ 
connected  systems  remains  valid  [5],  [8]-[lIJ.  For  in¬ 
stance,  a  constant  scale  factor  with  gain  k  is  modeled  by 
the  matrix 


P{z)- 


k 

.0 


0 

k. 


(6.41) 


while  a  single-variate  time-invariant  component  is  mod¬ 
eled  by 


P(z) 


n z) 
0 


0 

n*). 


(6.42) 


where  7T (r)  is  the  usual  transfer  function  for  the  compo¬ 
nent  and  7T(z).  —  I\—z).  Finally,  the  periodically  varying 
multiplier  defined  by 


is  modeled  by 


'<«>-[?  ii 


(6.43) 

(6.44) 


Although  time-varying  systems  are  not  traditionally 
viewed  as  multivariate,  the  class  of  periodically  varying 
discrete  time  systems  admits  a  frequency  domain  theory 
which  closely  resembles  the  classical  multivariate  theory. 
In  fact,  we  can  apply  the  results  of  Section  V  to  this  class 
of  time-varying  systems.  Although  it  is  not  well  known  in 
this  system  theory  community,  the  frequency  domain  the¬ 
ory  for  periodically  varying  discrete-time  systems  has  been 
rediscovered  by  a  number  of  researchers  over  the  past 
quarter  century  in  one  form  or  another  [5],  (8]-{ll).  The 
basic  theory,  however,  always  employs  an  a  by  a  matrix  of 
z-transforms  to  model  a  single-variate  system  of  period  n. 
For  the  present  example  we  will  take  n«  2  in  which  case  a 
single-variate  system  is  modeled  by  a  transfer  function  in 


Now,  consider  the  periodically  varying  feedback  system  of 
Fig.  3.  Here,  the  plant  is  a  time-invariant  system  with  a 
parallel  time-varying  gain  and  it  is  thus  modeled  by  the 
transfer  function  matrix 


F(z)- 


(2+z) 

1 


1 


(2-z) 


(6.45) 


Since  this  system  is  stable  (its  poles  are  located  at  z—  ±2), 
we  may  obtain  a  doubly  coprime  representation  for  P(z) 
by  letting  p(z)-/»(r),  /(z)-l,  q(z)- 0,  and  #(z)-l. 
Since  these  matrices  are  all  mutually  commutative  p(z) 
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may  serve  for  both  p,{z)  and  p,(z)  and  similarly  for  the 
remaining  p’s  and  q' s. 

Let  us  now  consider  the  problem  oi  designing  a  com¬ 
pensator  for  this  plant  which  will  cause  it  to  track  a 
prescribed  input  without  destabilizing  the  system.  For  this 
purpose  we  take 


(l-z1)1 


o-*3)2 


which  generates  an  input  which  oscillates  between  zero 
and  a  ramp.  Here  T(z)  is  unstable  (since  it  has  poles  at 
±  1)  but  its  inverse  is  stable.  Therefore,  we  may  form  a 
doubly  coprime  fractional  representation  for  T(z)  with 


'"(<)- 


(l-*3)2 


(l-*3)2. 


j (z)- 1,  and  f(r)«0.  Now 


(2+r) 


(2-*) 


z(l5—6z2) 
4(4— z2)(2— z) 
(13-6z2) 
4(4— z2) 


0  -z2f 


(15— 6z2) 
4(4— z2) 
-z(13— 6z2) 
4(4— zJ)(2+z) 

1 

(4— z2)2 


(1-83)2 


(4— z2)3 


mP(*)Ji(z)+‘(*)h(z)m  1 


£(z)-/(z)r'(z)-- 


(1-z2)2 


0-z2)3 


Since  E(z)  coincides  with  7\z)  we  may  let  e(z)«f(z), 
e(z)«r(z), /(z)»j(z),  and  /(z)-i(z)  define  our  doubly 
coprime  fractional  representation  for  E(  1.  Substituting 
into  (5.16)  we  obtain 


w(z)-- 


z(15-6z2) 
4(4— z2)(2— z) 
(15-6z2) 
4(4— z2) 


(15— 6z2) 
4(4— z2) 
-z(15-6z2) 
4(4— z2)(2+z) 


o,(z2)+zo,(z2)  o,(z2)-zo4(z2) 

o3(z2)+zc4(z2)  o,(z2)-zo2(z2) 

(1-z2)2  0 

f 

0  (1-z2)2 


which  defines  all  compensators  which  satisfy  the  con¬ 
straints  of  problem  ST  where  the  o,  are  arbitrary  stable 
rational  functions.  To  construct  an.  example  of  a  com¬ 
pensator  we  take  c,  —  0,  <*1,2,3, 4,  which  yields  a  w(z), 
which  is  just  the  first  term  in  (6.50).  Substituting  into 
(1. 10)  we  obtain  the  compensator 


C(z)- 


(2— z) 


(2+z) 


(15— 6z2)(4— z2) 
4(1 -z2)3 


Interestingly,  this  time-varying  compensator  perfectly 
cancels  the  time-variation  in  the  plant  yielding  a  time- 
invariant  open-loop  gain  and  time-invariant  feedback  sys¬ 
tem  gains  in  the  form 


and  hence  p(z)  and  f(z)  are  coprime,  verifying  the  ex¬ 
istence  of  a  solution  to  our  tracking  problem.  The  final 
step  required  to  obtain  the  desired  compensator  is  to 
compute 


(13— 6z2) 
(4— z2)3 


(15-6z2) 
(4— z2)2 


o 

Mz)-  .  •  («•»> 

o  ^ 

(4— z2)3  . 

Here,  the  (1-z2)2  lector  in  A,p(z)  indicates  that  the 
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Fig.  4.  Periodically  varying  feedback  system  that  tracks  1/(1  —z2)2. 


required  tracking  property  is  attained  while  the  system 
gains  are  clearly  stable.  The  resultant  feedback  system  is 
sketched  in  Fig.  4.  Although  it  is  not  clear  from  the  figure 
that  the  open  loop  gain  for  this  system  is  time-invariant, 
this  fact  can  be  verified  by  computing  P(z)C(s)  and 
showing  that  it  has  the  form  of  (6.42).  Alternatively,  an 
opportune  rearrangement  of  the  subsystems  in  Fig.  4  will 
lead  to  an  equivalent  time-invariant  system. 

VII.  Conclusion 

Although  the  above  derivations  were  occasionally  com¬ 
plex,  it  is  important  to  note  that  the  mathematics  em¬ 
ployed  was  quite  elementary.  Indeed,  at  no  time  have  we 
used  any  mathematical  techniques  which  are  more 
sophisticated  than  addition,  multiplication,  subtraction, 
and  inversion.  Moreover,  a  single  proof  technique  has 
been  employed  throughout  the  present  paper  as  well  as  in 
[6],  First,  one  formulates  a  design  equation  which  is  linear 
and  characterized  by  two  unknowns  in  the  ring  of  stable 
systems.  A  particular  solution  for  this  equation  is  con¬ 
structed  in  terms  of  a  specified  coprimeness  condition.  A 
homogeneous  solution  is  formulated  in  terms  of  an  arbi¬ 
trary  stable  design  parameter.  Finally,  a  coprimeness  con¬ 
dition  is  used  to  verify  that  all  homogeneous  solutions 
have  been  obtained  and  the  desired  parameterization  of 
the  solution  space  is  formulated  in  terms  of  the  specified 
particular  and  homogeneous  solutions.  We  believe  that 
this  algebraic  formulation  and  solution  technique  is 
fundamental  to  the  feedback  system  design  problem  and 
we  are  presently  investigating  several  additional  applica¬ 
tions  thereof. 

Although  our  design  equations,  formulated  in  Theorem 
2,  are  applicable  to  arbitrary  linear  systems,  the  solutions, 
formulated  m_  Section  VI,  are  restricted  to  multivariate 
systems  with  t  and  f  in  L.  An  inspection  of  the  proof  of 
Theorem  IV  will,  however,  reveal  that  the  multivariate 
assumption  was  not  used  in  the  proof  of  the  existence  of  a 
solution  to  problem  SR  (although  it  was  used  in  the 
parameterization  of  the  solution  space  and  in  the  deriva¬ 
tions  of  the  solutions  to  problem  ST  and  STR).  We  thus 
have  the  following. 

Theorem  6:  For  the  feedback  system  of  Fig.  1  let  P  and 
R  be  characterized  by  doubly  coprime  fractional  represen¬ 
tations.  Then  problem  SR  admits  a  solution  if,  and  only  if. 
Pi  and  r,  are  right  coprime.  (No  commutativity  assumption 
is  necessary.) 
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The  main  problem  of  adaptive  control  theory  is  to 
design  a  system  S  which  Is  capable  of  automatically 
adjusting  the  generated  control  Input  to  the  plant  P. 
Such  adjustments  may  be  necessary  for  a  variety  of 
reasons,  such  as  Insufficient  knowledge  about  the 
plant,  plant  perturbations,  etc.  A  multitude  of  adap¬ 
tive  control  techniques  have  been  proposed  through  the 
years.  A  characteristic  shared  by  all  of  them  is  the 
presence  of  some  means  of  Identifying  the  unknom  or 
perturbed  plant.  Of  course,  the  design  of  such  a 
mechanism,  termed  here  the  identifier.  Is  an  important 
question  in  its  own  right.  The  design,  however,  of  an 
adaptive  controller  is  heavily  Influenced  by  the  par¬ 
ticular  technique  used  to  generate  the  control  and  it 
therefore  Inherits  the  technique's  features. 

A  recent  advance  in  control  theory  Is  an  approach 
to  feedback  control  based  upon  the  representation  of 
the  plant  as  the  ratio  of  two  operators,  both  of  which 
belong  to  an  operator  ring  H.  (Ref).  A  brief  Overview 
of  the  approach  is  as  follows.  Consider  the  following 
ring  structure  R 

R  -  (G.H.I.JI  (1.1) 

where  C  is  a  not  necessarily  commutative  ring  with 
identity  representing  the  general  class  of  systems  of 
Interest.  The  subring  H  also  contains  the  identity 
and  represents  the  class  of  systems  which  in  some  sense 
are  stable.  I  is  the  set  of  elements  in  H  which  admit 
an  inverse  in  G  and  J  the  set  of  elements  in  H  which 
admit  an  inverse  in  H.  As  shown  In  (Ref), 

Gp  H  □  I  3  J  (1.2) 

A  plant  P  is  said  to  have  a  doubly  coprime 
fractional  representation  if  for 
(Nr.N1.Ur,U1.Vr.V1)c  H  and  (D^lc  T 


(1.3) 

“rWr"1 

(1.4) 

N1U1+D1V1  *1 

(1.5) 

The  aim  now  is  to  design  a  system  S  so  that  the 
system's  inout-output  map  h  is  placed  in  H.  Consider 
the  system  shown  in  Fig.  1.1  and  assume  that  P  has  a 
doubly  coprime  fractional  representation. 


x—  .  ■"  «  C 


Fig.  1.1.  A  feedback  control  system. 
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For  any  arbitrary  w,  let  the  compensator  C  be 
defined  at 

C-(wN1+¥p)',(-wDi*Up).  (1.6) 

It  was  shown  that  If  wcH,  then  the  input-output  map  h 
also  belongs  to  H  and 

h*Np(-wD^+Ur) .  (1.7) 

An  important  element  of  the  approach  Is  that  It 
provides  a  complete  characterization  of  the  set  of 
compensators  which  place  h  in  the  ring  H.  It  is  there¬ 
fore  desirable  to  investigate  the  conditions  under 
which  fractionally  represented  feedback  systems  can  be 
adaptively  controlled. 

Suppose  then  that  either  In  the  limit  as  t— ,  or 
for  all  times  t  i  tQ,  an  input-output  map  H  in  H  Is 
desired;  in  other  words,  suppose  that,  with  the  appro¬ 
priate  time  interpretation,  it  is  required  that 

h-H.  (1.8) 

Clearly,  there  exists  a  choice  of  three  independent 
variables,  namely  w,  U  and  V  ,  to  satisfy  two  linear 
equations.  The  decisibn  was  fiade  to  consider  w  as  a 
parameter  in  H.  Thus  the  problem  can  in  general  be 
stated  as  seeking  the  particular  coprimeness  operator 
pair  U_,V  which  for  a  given  w  in  H  simultaneously 
satisfies  tq.s  1.4  and  1.8. 

The  two  main  problems  to  be  addressed  here  are  tne 
acquisition  and  the  plant-follower.  In  the  former, 
the  linear,  time- invariant  plant  P  is  assumed  to  be 
insufficiently  specified  at  the  initial  time  tg.  The 
intention  is  to  provide  a  feedback  system  S  which 
consists  of  an  identifier  IJ2  and  an  adaptor  AD  as 
shown  in  Fig.  1.2.  The  identifier  provides  the  adaptor 
with  estimates  p(t)  of  the  plant  P  such  that 
lim  p(t)  •  P.  Then,  using  these  estimates,  the  adaptor 


Fig.  1.?.  An  adaptive  control  system. 

provides  the  compensator  with  an  operator  pair 
(ur(t),vr(t)l  such  that  the  required  coprimeness  pair 
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(Ur,V_)  Is  obtained  In  the  Halt.  The  first  task  Is 
to  delineate  the  class  of  plants  for  which  such  a 
systw  is  possible.  This  can  be  done  by  deriving  the 
necessary  and  sufficient  conditions  for  a  solution  to 
exist  under  the  assumption  of  Instantaneous  identifi¬ 
cation,  (i.e.,  a  perfect  identifier).  Then  it  would 
reaain  to  show  that  in  the  non- Ideal  case  the  solution 
can  be  attained  adaptively.  In  other  words  it  would 
be  required  that  Eq.s  1.4  and  1.8  are  satisfied  in 
the  limit. 

In  the  plant-follower  problem  the  linear  plant  P 
is  perfectly  known  at  the  initial  time  t<j,  but  it 
undergoes  perturbations  thereafter.  The  intention  is 
to  provide  the  compensator  with  an  operator  pair 
Ulr(t),Vr(t)}  such  that  the  systems  Input-output  map 
remains  invariant  under  .the  plant's  perturbations. 

In  other  words  Eq.s  1.4 'and  1.3  are  to  be  satisfied  at 
every  point  in  time.  Again  the  class  of  plants  for 
which  a  solution  exists  is  delineated  under  the  per¬ 
fect  identifier  assumption^-  In  the  non-ideal  case  it 
is  desirable  to  examine  the  extent  to  which  the  input- 
output  map  is  perturbed  due  to  the  plant  perturbation. 

As  always,  stability  is  a  question  of  paramount 
importance.  A  consequence  of  the  fractional  represen¬ 
tation  approach  is  the  fact  that  a  system  is  stable 
in  the  sense  of  H  whenever  the  system's  input-output 
map  is  time-invariant  and  the  coprimeness  operators 
belong  to  H.  This  is  exploited  in  the  Ideal  case  of 
both  problems.  But,  whereas,  in  the  acquisition  prob¬ 
lem  the  derived  stability  conditions  are  time-inde¬ 
pendent  and  hence  easy  to  rheck  a  priori,  in  the 
plant- follower  they  are  time-dependent  and  thus  the 
task  of  verifying  whether  they  hold  or  not  is  consid- 
erabley  harder.  However,  the  problem  is  by-passed  by 
showing  that  in  this  ca$e  the  question  of  the  coprime¬ 
ness  operators  belonging  to  H  is  equivalent  to  the 
classical  question  of  stability  in  the  sense  of  H  of  a 
system  with  time-invariant  feedforward  path  and  memory- 
less,  tine-varying  feedback  path.  In  the  adaptive 
case  of  the  pi ant-fot lower  problem  stability  is  re¬ 
solved  by  a  similar  criterion  applied  to  the  entire 
adaptive  acquisition  problem,  the  fact  that  the  input- 
output  map  converges  to  a  time- invariant  element  of  W 
suggests  that  the  system  is  stable  as  long  as  the  map 
remains  bounded.  It  is  shown  that  for  uniform  asymp¬ 
totic  stability  this  is  in  fact  the  case  as  long  as  a 
sufficiently  “good"  identifier  is  used.  (The  quality 
of  the  Identifier  is  also  shown  to  be  related  to  the 
robustness  of  the  adaptive  plant-follower  system). 

The  requirement  to  control  the  entire  input- 
output  map  restricts  the  application  to  a  class  of 
plants  which,  for  all  practical  purposes,  is  only 
slightly  larger  than  the  miniphase  case.  But  if  a 
less  restrictive  requirement  is  imposed  the  class  be¬ 
comes  considerably  larger.  The  point  is  demonstrated 
by  the  pole  positioning  problem  for  plants  represented 
as  rational  functions  (not  necessarily  proper).  It 
is  shown  that  the  problem  is  equivalent  to  solving  a 
linear,  algebraic  equation.  Furthermore,  a  solution 
to  the  equation  is  shown  to  exist  provided  that  the 
number  of  poles  to  be  positioned  is  sufficiently  large. 
In  terms  of  adaptive  control,  the  equation  must  be 
solved  repeatedly  in  time  by  any  of  the  available 
methods,  (e.g.  a  continuation  algorithm). 
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Abstract 


A  new  approach  to  the  design  of 
suboptiraal  controllers  for  constrained, 
nonlinear,  decentralized,  and  non-quad¬ 
ratic  systems  is  presented.  Here,  one 
designs  a  quadratic  regulator  for  an 
idealized  system  but  chooses  the  weight¬ 
ing  matrices  for  the  regulator  to  opti¬ 
mize  its  performance  as  a  controller  of 
the  actual  system  relative  to  a  pre¬ 
scribed  (not  necessarily  quadratic)  per¬ 
formance  measure.  The  approach  is  illus¬ 
trated  via  several  examples. 

I.  Introduction 

Historically,  the  control  system  de¬ 
signer  has  been  faced  with  the  dilemma: 
"Should  he  work  with  an  idealized  model 
of  a  system  which  is  amenable  to  simple 
solutions  or  a  ’real  world’  model  which 
say  be  intractable?’  The  former  approach 
is  epitomized  by  the  LQG  school  wherein 
a  highly  idealized  model  of  a  real  world 
system  yields  an  easily  implementable 
analytic  control  theory  (11,12).  Alter¬ 
natively,  a  more  realistic  model  may  be 
employed  in  conjunction  with  a  nonlinear 
programming  algorithm  at  the  cost  of  a 
more  complex  implementation  and  increased 
computer  requirements  (2,3). 

The  purpose  of  the  present  paper  is 
to  describe  an  intermediate  approach  to 
the  control  system  design  problem, 

(4,5,13)  wherein  one  designs  a  quadratic 
regulator  for  an  idealized  system  but 
chooses  the  weighting  matrices  for  the 
regulator  to  optimize  its  performance  as 
a  controller  of  the  actual  system  rela¬ 
tive  to  a  prescribed  (not  necessarily 
quadratic)  performance  measure.  The  ad¬ 
vantage  of  such  an  approach  is  that  the 
resultant  regulator  has  the  same  "ease 
of  implementation’  and  most  of  the  "sta¬ 
bility  characteristics"  associated  with 
the  classical  LQG  problem.  The  disadvan¬ 
tage  is  that  the  system  performance  is 
suboptimal.  Computationally,  the  process 
does  not  require  any  more  effort  than 
required  for  a  nonlinear  optimization. 

•This  research  supported  in  part  by  the 
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Moreover,  the  on-line  computation  needed 
to  implement  the  system  is  greatly  re¬ 
duced  from  that  which  would  be  required 
for  an  optimal  non-linqar  controller. 

The  basic  steps  required  for  our 
proposed  design  procedure  are: 

i.  Approximate  the  given  (nonlinear, 
interconnected,  or  constrained) 
system  by  a  linear  state  model. 

ii.  Design  a  regulator  for  the  approxi¬ 
mate  state  model  which  minimizes  a 
quadratic  performance  measure  with 
weighting  matrices  Q  and  R. 

iii'.  Evaluate  the  performance  of  the 

actual  system  using  the  above  quad¬ 
ratic  control  strategy. 

iv.  Optimize  the  performance  of  the 

actual  system  under  such  a  control 
strategy  as  a  function  of  the 
weighting  matrices  Q  and  R. 

As  is  the  case  with  any  "real  world* 
design  algorithm,  its  effectiveness  can 
be  measured  only  by  its  performance  in 
engineering  practice.  As  such,  the  re¬ 
mainder  of  the  paper  is  devoted  to  a 
series  of  examples  in  which  the  above- 
described  design  procedure  is  applied  in 
a  variety  of  settings  and  compared  with 
existing  design  procedures.  The  examples 
include  three  tracking  systems,  a  case  of 
control  under  input  constraints  and  a 
case  of  decentralized  control. 

II.  Tracking  Systems 

To  illustrate  the  design  procedure 
we  begin  with  a  simple  first  order 
tracking  problem.  Here  the  system  dynam¬ 
ics  are 

x(t)  •  -  x(t)  ♦  u( t)  ;  x  (t0)  •  x0  2.1 

The  optimal  input  u*(t)  minimizing 
the  quadratic  index 


Cqx2(t)  ♦  u2(t)]  <Jt 

2.2 

-  -P(q,t)  x  (t) 

2.3 
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where  P(q,t)  is  the  Riccati  equation 
solution.  Hence  the  optimal  state  tra¬ 
jectory  is  given  by 


He  now  seek  the  weight  q*  which 
minimizes  a  given  performance  index  Jg. 

2.5 

S*2*_Ls  J£  -  .£  ♦  Cu*(t)]2)  it' 

The  trajectories  required  by  J£ 
and  typical  regulator  trajectories 
are  shown  in  Fig.  2-1.  Obviously  q* 
depends  upon  xQ.  On  the  other  hand, 

finding  q*  given  an  xQ  is  not  always 
trivial  (refer  to  Fig.  2-2) . 


fl|.  2-1  l«»M  a>W  Ty»<«al  lifymn  -Uu  1 


the  state  trajectory  approximates  its 
requirement  better  and  better.  Thus  the 
regulator  is  "instructed”  to  emphasize 
the  input  cost  by  decreasing  q*  which 
then  causes  the  input  to  decrease  and 
thus  approximate  its  requirement  better. 
Then,  since  a  scalar  stable  system  with 
a  quadratic  regulator  is  always  bounded 
by  its  natural  response,  which  in  turn 
is  always  bounded  by  the  initial  condi¬ 
tion,  the  best  approximation  to  the  x*l 
requirement  that  the  system  could  ever 
achieve  for  0<x^<l  is  i  its  own  natural 

response.  Thus  for  this  range  q*"0. 

For  x  -0  q*  is  indeterminate  since  both 
o 

state  and  input  are  identically  zero. 

Once  xq  becomes  slightly  negative,  the 

regulator  is  instructed  to  drive  the 
state  to  zero  as  fast  as  possible  be¬ 
cause  the  state  can  now  only  add  to  the 
error.  However,  as  becomes  more  and 

more  negative,  the  effort  in  driving  the 
state  to  zero  becomes  significant  also 
and  hence  q*  approaches  unity.  Table  2-1 
shows  that  the  present  approach  holds  an 
advantage  over  the  competitive  approach 
with  an  optimally  chosen  constant  gain 
h. 

TikU  2-1  TtM-ImHut  »«n is  Tlw-nrimt  Opthal  Dal|«— Cm  1 


As  x  -+ +oc  ,  q*->  +1  because  for  x„ 
o  -  2  2  ° 

being  very  large,  (x(t)-l)  ~x  (t)  and 
thus  equal  importance  is  placed  upon 
the  state  and  the  input  term  of  .  As 
x  goes  to  zero  from  above. 


rt|.  2-1  if  I*  wfV  <t.  1 


Case  2 i 

Required  and  typical  trajectories 
are  shown  in  Fig.  2-3  whereas  the  depen¬ 
dence  of  q*  upon  x  is  shown  in  Fig.  2-4. 
Once  again,  to  benchmark  our  results. 
Table  2-2  compares  the  present  approach 
to  optimally  choosing  a  time  invariant 
gain  h.  A  noted  improvement  is  obtained 
for  x0“l*  the  advantage  becoming  less 

profound  as  xQ  gets  larger. 

3.  •  Z1  (Cl  -  t  -  x*(t)j2  ♦  C«J*(t)]2>  dt 

*■  1 
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JjttJ 

with  x2(0)-o  and  x^(0)  variable.  Our 

goal  is  to  design  a  regulator  for  the 
index 

20  t  . 

JS  “  l  C*  (t)Qx(t)  ♦  u2(t»  dt  2  a 

so  that  the  measure  ^ 
to 

J,  •  j  (|ij(t)  -  o.oitl  *  o.ooi|u(t)|)  at  ♦ 
o 

♦  (|*j(t)  ♦  o.oit  -  o.*|  ♦  0.001  |«(t)|»  at  -  2.9 

is  minimized  over  the  diagonal  matrix 
Q.  The  optimal  designs  are  plotted 
against  x^O)  in  Fig.  2-6.  optimal 

trajectories  of  x2(t)  . 

■*« 
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As  a  variation  on  the  tracking  prob¬ 
lem,  consider  the  design  of  a  pulse 
forming  system  whose  output  is  required 
to  approximate  a  triangular  pulse.  Speci¬ 
fically  assume  that  x^t)  is  to  approxi¬ 
mate  the  pulse  shown  in  Fig.  2-5  where 
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are  shown  in  Pig.  2-7.  it  is  perhaps  of 
interest  to  note  that  as  x^oj-^co  ,  the 

control  is  dominated  by  causing 
Xj  (t)  to  be  regulated  through  its  deriv¬ 
ative. 


III.  An  Aircraft  Landing  Problem 

A  four  state  model 

xCt)-Fx(t)  +  Gu (t)  3*1 

.for  the  landing  of  an  aircraft  was  pre¬ 
sented  in  (1,2)  and  was  used  in  (3)  to 
treat  the  same  problem  with  dynamic  pro¬ 
gramming.  Although  somewhat  simplified, 
the  model  is  adequate  for  illustrating 
our  technique. 

' The  states  are  defined  in  terms  of 
the  aircraft's  coordinates  and  angles  as 
(refer  to  Fig.  3-1) . 

x(t)«(8(t) ,9(t) ,h(t) ,h(t) ) ,  3.2 

whereas  the  input  u(t)  is  the  elevator 
deflection  angle  6e ( t)  which  is  mechan¬ 
ically  constrained  between  -35°  and  15°. 
Consistent  with  our  suboptimal  design 
approach  we  will  initially  neglect  this 
limiting  effect  and  design  a  quadratic 
regulator  for  the  linear  system  but  we 
will  choose  the  weighting  matrices  so 
that  the  behavior  of  the  actual  nonlinear 
system  is  optimal  relative  to  a  perfor¬ 
mance  index  J  .  This  index  is  chosen  to 
simultaneously  achieve  a  safe  and  com¬ 
fortable  landing.  The  complexity  of  the 
resulting  J  is  such  that  rather  than 
giving  an  explicit  description  here  we 
will  only  list  some  of  the  factors  en¬ 
tering  its  formulation.  The  interested 
reader  is  referred  to  (4,5)  for  the 
details. 


7777777///  7/  77777/ >'777777777; 
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Safety  Conditions 

(1)  The  angle  of  attack  3((t)  must  be 
less  than  about  18°  and  not  considerably 
negative. 

(2)  At  touchdown  the  airplane  must  be  on 

the  runway  and  within  a  distance  d  from 
the  runway's  start.  3 

(3)  At  touchdown  the  rate  of  descent 

h(T)  must  be  between  -1.75  ft/sec  and 
0.0  ft/sec.  * 

(4)  At  touchdown  the  pitch  angle  e(T) 
must  be  within  0°  and  10°. 

Comfort  Conditions 


(1)  Avoid  all  accelerations. 

(2)  Avoid  a  "hard”  landing. 

(3)  Avoid  a  negative  pitch  angle. 

The  underlying  assumption  is  that 
safety  takes  precedence  over  comfort  and 
thus  should  any  of  the  safety  conditions 
be  violated,  J_»CO.  It  is  noted  that  the 
selection  of  a2, matrix  Q  to  satisfy  these 
conditions  is  not  a  trivial  task. 

It  was  shown  in  (4)  that  stability 
considerations  for  the  nonlinear  model 
lead  to  the  establishment  of  an  upper 
bound  for  Q.  In  particular,  if  F  is  in¬ 
vertible  and  P (Q, t)  the  Riceati  equation 
solution,  the  upper  bound  on  Q  is  estab¬ 
lished  by 

R-GTP(Q)F-1G  3.3 

where  P(Q)*lim  P(Q,t).  The  landing  was 
simulated  on  an  IBM-370  and  for  an 
initial  state 


x(0)-  (0.0, -0.0181, -20. 0,100.0)  3*« 
the  optimal  matrix  Q  was  found  to  be 

3*5 

diag'fc*-  (21. 0,21. 0,0. 0016, 0.00047). 
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The  resultant  optimal  input  and 
elevation  trajectories  are  shown  in 
Figs.  3-2  and  3-3. 


The  dimension  of  a  large  dynamical 
system  is  often  so  large  that  control  on 
a  global  scale  is  prohibitive  due  to  ex¬ 
cessive  requirements  on  computer  storage 
or  time.  In  such  cases  it  may  be  desir¬ 
able  to  consider  the  system  as  a  collec¬ 
tion  of  subsystems  S^, i»l,2 , . . . ,N  and 

control  each  subsystem  separately.  Sev¬ 
eral  schemes  have  been  suggested  varying 
from  ignoring  the  subsystem  interconnec¬ 
tions  to  providing  a  separate  control  to 
neutralize  their  effect  (6.7,8,9,10) . 

As  an  alternative  we  adopt  our  sub- 
optimal  approach  to  control  system  de¬ 
sign.  Here  the  given  interconnected 
system  is  initially  approximated  by  a 
decoupled  system  for  which  decoupled  qua¬ 
dratic  regulators  are  designed.  These 
regulators  are  then  employed  to  control 
the  coupled  system  with  the  weighting 
matrices  being  chosen  to  optimize  the 
performance  of  the  resultant  coupled 
system  relative  to  some  performance  mea¬ 
sure  J£. 


To  illustrate  the  approach  we  con¬ 
sider  the  system 

x(t)  «  Fx(t)  +  Su(t)  «■  Hx(t);  x(tQ)  «  x0  4.1 

where  F  and  G  are  block  diagonal  matrices 
characterizing  the  decoupled  system  com¬ 
ponents  and  H  is  a  coupling  matrix. 


For  our  example  we  take 
'“-0.10  0.05  J 
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We  now  approximate  the  given  system 
by  the  decoupled  system 


x<t)  -  Fx(t)  ♦  Gu(t) 


4.5 


and  design  a  decoupled  regulator  which 
minimizes 

JS  “  f  Cxt(t5Qx(tl  *  “t(t)u(t)3  dt.  4.6 

where  Q  is  a  block  diagonal  matrix. 
This  regulator  is  then  used  to  control 
the  coupled  system  of  Eq.  4-1  with  Q 
chosen  to  minimize  the  global  perfor¬ 
mance  measure 

10 

J-  -  j  Cx*(t)  x(t)  ♦  u’(t)  u(t)]  dt  4.7 


to  be 
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It  is  interesting  to  compare  the 
above  described  approach  to  the  so- 
called  "naive"  scheme  where  the  couplings 
between  subsystems  are  totally  ignored. 
(To  the  contrary,  we  design  decoupled 
regulators  for  each  subsystem  but  choose 
the  weight  matrices  through  a  minimiza¬ 
tion  of  J  where  all  the  couplings  are 
included) .  In  this  particular  example 
the  naive  approach  yields  a  J  =*6.590, 
whereas  our  algorithm  yields  J  *6.349, 
a  3.8  percent  improvement. 

V.  Conclusions 

Our  purpose  in  the  preceding  has 
been  the  formulation  of  a  new  approach 
to  the  suboptimal  control  of  nonlinear, 
constrained,  decentralized  and  non-qua¬ 
dratic  systems.  In  essence  we  restrict 
ourselves  to  a  subclass  of  controllers 
by  assuming  that  our  controller  will  take 
the  form  of  a  quadratic  regulator  for  a 
linear  approximation  of  the  given  system. 
We  then  choose  a  particular  regulator  so 
as  to  optimize  the  performance  of  the 
given  system  within  our  class.  The  ad¬ 
vantage  of  the  approach  is  the  relative 
ease  in  implementing  the  resultant  con¬ 
trol  strategy  and  the  stability  inherent 
in  the  use  of  a  quadratic  regulator.  The 
disadvantage  is  the  suboptimality  of  the 
result  and  the  computational  effort  re¬ 
quired  to  choose  the  optimal  weighting 
matrices. 
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ABSTRACT 

An  exollclt  relationship  between  the  fraction¬ 
al  representation  approach  to  feedback  systen  de¬ 
sign  and  the  algebro-qeometric  approach  to  systen 
theory  Is  formulated  and  used  to  derive  a  global 
solution  to  the  feedback  system  design  problem. 
These  techniques  are  then  applied  to  the  simul¬ 
taneous  stabilization  problem  yielding  a  natural 
geometric  criterion  for  a  set  of  plants  to  be  sim¬ 
ultaneously  stabilized  by  a  single  compensator. 

I.  SUTtttRY 

Classically,  In  control  theory  one  is  given  a 
plant  and  desires  to  design  a  control  system 
around  this  plant  which  meets  certain  design 
specifications.  In  fact,  however,  a  'real  world* 
plant  Is  never  known  exactly  and,  as  such,  a  rea¬ 
listic  design  must  simultaneously  meet  specifica¬ 
tions  over  an  entire  range  of  plants  which 
(hopefully)  Include  the  actual  plant.  The  simplest 
form  of  the  resultant  MuiXantotu  cfoUgn  pnoblun 
Is  the  aobo tf.  d&Ugn  ptoblim  wherein  one  desires 
to  meet  the  design  specifications  In  an  e-ball 
around  a  prescribed  nominal  plant.  Although  this 
is  satisfactory  for  dealing  with  modeling  errors 
It  cannot  cope  with  plants  containing  unknown 
parameters  and/or  plants  characterized  by  multiple 
modes  of  operation.  For  Instance,  the  dynamics  of 
an  airplane  or  rocket  vary  widely  with  altitude 
while  the  dynamics  of  an  electric  motor  change 
with  speed  and  load.  To  cope  with  these  problems 
we  must  formulate  a  limUbtonto ui  dcs-ign  theory  in 
which  one  designs  a  control  system  to  simultaneous¬ 
ly  meet  specifications  over  a  prescribed  set  of 
plants.  Of  course,  the  set  of  plants  may  be  taken 
to  be  a  ball  In  which  case  the  classical  robustness 
theory  Is  replicated.  Alternatively,  one  may 
choose  to  work  with  a  set  of  plants  In  which  one 
or  more  parameters  vary  over  a  prescribed  range 
and/or  a  discrete  set  of  plants;  say  the  dynamics 
of  a  two  speed  motor  In  its  high  and  low  speed 
settings. 

The  simultaneous  design  concept  Is  possibly 
best  Illustrated  In  the  1st  order  case,  wherein  a 
simple  geometric  solution  suggests  Itself.  Assume 
that  our  plants  are  of  the  form 

P(S)  -  ^4-ff  0-1) 

and  we  desire  to  design  a  stable  feedback  system 


1 

using  a  proportional  compensator  with  gain  t.  This 
results  in  a  system  with  characteristic  function 

d(s)  ■  s  «•  (B  ♦  tA)  (1.2) 

and,  as  such,  the  feedback  system  will  be  stable 
if  and  only  If  B  +  tA  >  0.  Here,  for  a  given 
compensator,  t,  the  feedback  system  will  be  stable 
If  and  only  If  the  point  (A.B)  lies  above  the  line 
with  slope  1/t  as  shown  In  figure  la.  As  such.  If 
we  want  to  simultaneously  stabilize  an  entire  set 
of  plants  their  representations  on  the  A-B  plane 
must  all  lie  above  a  line  through  the  origin.  For 
instance,  the  set  of  plants  Indicated  by  the  hatch¬ 
ed  region  in  figure  lb.  can  be  simultaneously 

stabilized'  (by  a  compensator  with  gain  -  Jr)  while 

the  set  of  plants  shown  in  figure  lc.  cannot  be 
simultaneously  stabilized  since  they  subtend  an 
angle  greater  than  180  degrees  on  the  A-B  plane. 
Similarly,  the  set  of  plants  shown  In  figure  Id. 
cannot  be  simultaneously  stabilized  since  they 
cross  the  negative  A-axIs. 

The  example  suggests  two  alternative  criteria 
for  the  simultaneous  stabilization  problem.  One 
may  adopt  an  algebraic  criterion  to  the  effect 
that 

B  ♦  tA  »  0  (1.3) 

for  each  plant  In  the  prescribed  set  and  soma  t. 
While  such  a  test  Is  deflnatlve  It  is  local  in 
nature  allowing  one  to  test  for  stablllzablllty  on 
a  plant  by  plant  basis  but  yielding  no  global 
criterion  with  which  to  characterize  a  set  of 
plants  which  Is  simultaneously  stablllzable.  To 
the  contrary  one  may  adopt  a  global  geometric  view¬ 
point  to  the  effect  that  a  prescribed  set  of 
plants  Is  simultaneously  staballzable  if  and  only 
If  It  Is  contained  In  an  appropriate  half-plane. 

The  goal  of  the  present  paper  Is  the  formulation 
of  a  similar  geometric  criterion  for  the  simul¬ 
taneous  stabilization  problem  applicable  to  general 
linear  systems. 

The  starting  point  for  our  theory  Is  the  ring 
theoretic  fractional  representation  theory  Intro¬ 
duced  by  the  authors  In  a  series  of  recent  papers 
Jn  which  the  set  of  stabilizing  compensators  for  a 
given  plant  are  parameterized. Indeed,  with 
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minor  modifications  one  can  invoke  the  same  theory 
to  parameterize  the  set  of  plants  which  are  stabi¬ 
lized  by  a  given  compensator.  This,  in  turn, 
yields  an  inwdiate  algebraic  criterion  for  the 
simultaneous  stabilization  problem.  The  resultant 
criterion  is,  however,  local  in  nature  just  as  that 
of  equation  1.3.  The  desired  global  criterion  for 
simultaneous  stabilization  can,  however,  be  ob¬ 
tained  if  one  first  translates  the  fractional 
representation  theory  into  an  appropriate  geonetric 
setting. 

Indeed,  the  appropriate  geometric  setting 
proves  to  be  just  the  Grassmannian  first  Introduced 
into  the  system  theory  literature  by  Hermann  and 
Martin.  Unlike  their  frequency  domain  formula¬ 
tion,  however,  we  obtain  the  Grassmannian  directly 
from  the  ring  theoretic  fractional  representation 
previously  employed  by  the  authors.  Indeed,  the 
Grassmannian  Is  obtained  simply  by  factorinq  out 
the  non-uniqueness  inherent  in  the  fractional  repre¬ 
sentation  theory.  As  such,  in  addition  to  formulat¬ 
ing  the  global  theory  necessary  for  our  study  of 
the  simultaneous  stabilization  problem  the  geometric 
approach  yields  new  insight  into  the  relationship 
between  the  fractional  representation  theory 
(which  we  identify  with  the  elements  of  a  general 


linear  group)  and  the  system  itself  (which  we 

identify  with  the  elements  of  a  Grassmannian) . 

II.  REFERENCES 

1.  Desoer,  C.A.,  llu,  R.-w.,  Murray,  J.J.,  and  R. 
Seeks,  "Feedback  System  design:  The  Fractional 
Representation  Approach  to  Analysis  and  Syn¬ 
thesis",  IEEE  Trans,  on  Auto.  Cont.,  Vol.  AC-25, 
pp.  401-412,  (1980). 

2.  Hermann,  R.,  and  C.  Margin  "Applications  of  Al¬ 
gebraic  Geometry  to  linear  System  Theory".  IEEE 
Trans,  on  Auto.  Cont.,  Vol.  AC-22,  po.  19-25, 
(1977). 

3.  Herman,  P...  and  C.  Martin  “Applications  of  Al¬ 
gebraic  Geometry  to  System  Theory:  The  Mac¬ 
Millan  Degree  and  Kronecker  Indices  as  Topolog¬ 
ical  and  Holomorphic  Invariants",  SIAM  Jour,  on 
Cont.,  Vol.  16.  pp.  743-755,  (1978). 

4.  Saeks,  R.,  and  J.J.  Murray,  "Feedback  System 
Design:  The  Tracking  and  Disturbance  Rejection 
Problems",  IEEE  Trans,  on  Auto.  Cont.,  (to 
appear). 


Simultaneous  stabilization  of  a  1st  order  plant 
with  a  proportional  compensator. 


Abstract  of 


FEEDBACK  SYSTEM  DESIGN:  THE  SINGLE-VARIATE  CASE 


R.  Saeks,  J.  Hurray,  0.  Chua, 
C,  Karmokolias  and  A.  Iyer 


Abstract 

A  recently  developed  algebraic  approach  to  the  feedback  system  design 
problem  is  reviewed  via  the  derivation  of  the  theory  in  the  dingle-variate 
case.  This  allows  the  simple  algebraic  nature  of  the  theory  to  be  brought 
to  the  fore  while  simultaneously  minimizing  the  complexities  of  the 
presentation.  Rather  than  simply  giving  a  single  solution  to  the  prescribed 
design  problem  we  endeavor  to  give  a  complete  parameterization  of  the  set  of 
compensators  which  meet  specifications.  Although  this  might  at  first  seem 
to  complicate  our  theory  it,  in  fact,  opens  the  way  for  a  sequential  approach 
to  the  design  problem  in  which  one  parameterizes  the  set  of  compensators 
which  meet  one  specification  and  then  characterizes  the  subset  of  those  com¬ 
pensators  which  meet  the  second  spec.,  etc.  etc.  Specific  problems  investi¬ 
gated  include  feedback  system  stabilization,  the  tracking  and  disturbance 
rejection  problem,  robust  design,  transfer  function  design,  pole  placement 
simultaneous  stabilization,  and  stable  stabilization. 
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Abstract  of 


FRACTIONAL  REPRESENTATION,  ALGEBRAIC  GEOMETRY  AND  THE 
SIMULTANEOUS  STABILIZATION  PROBLEM 


R,  Saeks  and  J.  Murray 
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Abstract 


An  explicit  relationship  between  the  fractional  representation  approach 
feedback  system  design  and  the  algebro-geometric  approach  to  system  theory 
is  formulated  and  used  to  derive  a  global  solution  to  the  feedback  system 
problem.  These  techniques  are  then  applied  to  the  simultaneous  stabilization 
problem  yielding  a  natural  geometric  criterion  for  a  set  of  plants  to  be 
simultaneously  stabilized  by  a  single  compensator. 
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Abstract  of 


SIMULTANEOUS  DESIGN  OF  CONTROL  SYSTEMS 


R.  Saeks  and  J.  Murray 
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Abstract 


The  problem  of  designing  a  feedback  controller  which  stabilizes  a 
number  of  plants  simultaneously  is  discussed  from  the  fractional  represen¬ 
tation  point  of  view.  An  abstract  solution  of  this  general  simultaneous 
stabilization  problem  is  presented,  and  an  elementary,  explicit  criterion  is 
given  for  the  simultaneous  stabilizability  of  two  systems.  Finally,  some 
examples  and  counter  examples  are  presented,  and  some  open  problems  are 
discussed. 
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4.  Other  Funds:  Current  $52,090* 

5.  Total  Number  of  Professionals:  Pi's  2  (3  mo. )  RA's _ 

6.  Summary: 

Although  linearizations  have  long  been  employed  in  the  analysis  and  de¬ 
sign  of  nonlinear  control  systems  their  applicability  is  severely  limited 
by  the  approximate  nature  of  the  concept.  To  the  contrary,  if  one  could 
formulate  an  exact  tians ianmation  o a  nonlinear  system  into  a  lineal  system 
the  established  techniques  for  linear  system  analysis  and  design  could  be 
applied  to  the  nonlinear  problem.  The  goal  of  the  present  work  unit  is  the 
formulation  of  such  an  exact  linearization  theory  via  the  di^eiential 
geometric  techniques  previously  employed  by  the  senior  investigator  in  his 
investigation  of  the  contio liability,  observability,  and  stabilizability 
characteristics  of  a  nonlinear  system. 

Although  the  exact  linearization  problem  goes  back  to  Poincare  and  has 
been  studied  by  a  number  of  system  theorists  over  the  past  decade  with  the 
aid  of  a  generalized  class  of  transformations  introduced  by  Su  we  have  been 
able  to  formulate  readily  testable  necessary  and  sufficient  conditions  for 
the  solution  of  the  exact  linearization  problem.  Moreover,  when  these 
conditions  are  satisfied  the  required  transformation  is  given  by  the 

*NASA  Grant  in  support  of  Professor  Hunt's  leave  of  absence  at  NASA/ AMES 
during  the  1981/1982  academic  year. 
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solution  of  appropriate  partial  differential  equations. 

Although  one  might  expect  that  the  set  of  nonlinear  systems  which  ad¬ 
mit  an  exact  linearization  would  be  quite  thin;  and,  indeed,  this  is  the 
case;  in  practice,  we  have  found  that  many  "real  world"  systems  either 
satisfy  the  required  conditions  and/or  are  approximable  by  such  systems. 

Indeed,  the  exact  linearization  concept  has  been  successfully  implemented 
at  NASA/ AMES  on  several  autopilot  systems. 
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LINEAR  EQUIVALENTS  OP  NONLINEAR  TIME-VARYING  SYSTEMS 


L.  R.  Hunt*  and  Ren j eng  Su** 

Ames  Research  Center,  NASA 
Moffett  Field,  California. 


Abstract 

Recent  results  have  shown  that  a  single-input  time- invariant  system 
of  the  form 

x  ■  f  (x)  +  g(x)u 

can  locally  be  transformed  into  a  series  of  integrators  i^f  and  only 
if  (1)  the  vector  fields  g,  (adlf,g),  .  .  .,  (adn“2f,g)  are  involu- 
tive,  and  (2)  g,  (adlf,g),  .  .  (ada-1f,g)  are  linearly  indepen¬ 
dent  in  a  neighborhood  of  the  origin  in  IRn.  This  result  is  gener¬ 
alized  to  time-varying  systems.  A  parallel  theorem  is  obtained  in 
terms  of  the  time-varying  version  Lie  derivative  r. 


1 .  INTRODUCTION 

This  paper  is  concerned  with  the  problem 
of  equivalence  of  nonlinear  systems  and  a 
particular  linear  system,  that  is,  a  series 
of  integrators.  Expressed  in  state  space 
form  it  is 

xi  =  x2,  X2  =  Xi,  .  .  .,  xn-i  =  xn,  Xn  =  u, 

where  xi,x2/  .  .  .,xn  are  the  state  var¬ 
iables  and  u  the  control.  We  shall  call 
this  system  the  canonical  linear  system 
and  denote  it  by  I0. 

In  the  past  this  problem  has  been  studied 
by  Meyer  and  Cicolani ^ 1  *  and  by  Brockett  * 2  ^  . 
They  obtain  two  different  (but  intersect¬ 
ing)  classes  of  non linear  systems  which 
can  be  transformed  into  the  canonical 
linear  system  Z 0 . 


In  reference  3,  Su  points  out  the  differ¬ 
ence  between  the  equivalence  relations 
defined  by  Meyer  and  Brockett.  Considering 
the  class  of  nonlinear  systems  with  scalar 
input  of  the  form  x  *  f(x,u),  and  using 
the  equivalence  relation  defined  by  Meyer, 
necessary  and  sufficient  conditions  for  a 
nonlinear  system  to  be  equivalent  to  the 
system  £0  are  given.  This  is  the  largest 
equivalence  class  in  terms  of  state  coordi¬ 
nates  change  and  feedback,  and  it  properly 
contains  the  results  of  Meyer  and  Brockett. 

Later  in  reference  4,  the  authors  extended 
that  result  to  a  sufficient  theorem  on  the 
problem  of  global  equivalence.  In  another 
paper (submitted  to  this  conference) 
they  also  show  an  interesting  connection 
between  the  renowned  Poincare  lemma  and  the 
construction  of  the  desired  transformation. 


•Researcher  supported  by  Ames  Research  Center,  NASA,  under  the 
Intergovernmental  Personnel  Agreement  (IPA)  Program  and  the  Joint 
Services  Electronics  Program  at  Texas  Tech  University  under  Office 
of  Naval  Research  (ONR)  Contract  76-C-1136. 

♦•Research  Associate  of  National  Research  Council  at  Ames  Research 
Center,  NASA. 
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This  paper  is  devoted  to  generalizing  the 
previous  results  from  reference  3  to  the 
time-varying  case.  In  section  2  we  define- 
the  equivalence  relation;  the  main  results 
are  stated  in  section  3.  Because  the 
arguments  in  reference  3  can  be  applied 
here  with  only  slight  modification,  most 
of  the  proofs  of  the  theorems  are  omitted. 

2.  EQUIVALENCE  RELATION 

We  consider  the  scalar  input,  time- varying 
systems  on  2Rn  of  the  form 

E:  x  ■  f(x,u,t) 

where  the  origin  is  an  equilibrium  state 
for  any  time  t  when  u  =>  0 . 

A  transformation  is  a  map  T:  2Rn+2-»3Rn+1 
such  that  for  any  t  €  ir  the  restricted 
map  T(*,t):  3Rn+1  *  IRn+l  is  a  diffeo- 
morphism.  For  each  system  E:  x  *  f(x,u,t) 
the  combined  vector  (x,u)  of  the  state  and 
the  control  is  considered  as  an  element  of 
the  space  lRn+l .  With  the 
^■-transformations  we  can  define  a  concept 
of  system  equivalence. 

A  system  Ei?  x  ■  f(x,u,t)  is  said  to  be 
^-related  to  a  system  E2:  y  *  g(y,v,t)  if 
there  is  a  ^-transformation  T  that  car¬ 
ries  each  state  trajectory  of  Ei  into  a 
state  trajectory  of  E2;  that  is,  setting 
T(0(t;x0,u)  ,u(t)  ,t)  =*  { y ( t )  , v(t) )  ,  we  have 

(t)  =  g(y (t) ,v(t) ,t) , 


where  $(t;x,,u)  is  the  state  trajectory 
of  Ei  with  respect  to  the  initial  condi¬ 
tion  x,  and  control  u(t).  The  follow¬ 
ing  important  observation  of  this  relation 
is  parallel  to  that  in  reference  3. 

Proposition  Is  A  system  Ets  x  *  f(x,u,t) 
is  ./-related  to  a  system  Z2:  y  ■  g(y,v,t) 
if  and  only  if  there  is  a  transformation 
T  m  (Ti,  .  .  .,Tn,Tn+i)  such  that  T  has 
the  property 


3T 

TT 


0,  and  2*011  *  0 

aU 


and  satisfies  the  following  system  of  par¬ 
tial  differential  equations: 


n 

z 


3T.  3T . 

X  £  J.  X 

Ix~  f  j  Tt“ 


9i<Ti'  • 


i 


'Tn'Tn+i 't) 


1  #  •  •  •  /  n  / 


where  f  *  \flt  .  .  .  ,fn)  and 
g  ■  ^9: •  •  •  • »9n^ • 

Based  on  this  observation  one  can  prove 
that  the  ^-relation  id  indeed  an  equiva¬ 
lence  relation  among  the  systems.  There¬ 
fore,  we  are  justified  in  saying  that  two 
systems  are  y-equi valent .  In  the  next 
section  we  shall  characterize  the  equiva¬ 
lence  class  that  contains  the  canonical 
linear  system  E0. 

3 .  MAIN  RESULTS 

First  we  introduce  our  notations.  Since 
only  the  local  theory  is  attempted  here, 
everything  on  IRn  will  be  defined  locally 
near  the  origin. 

We  assume  the  reader  is  familiar  with  the 
basic  definitions-  of  vector  fields  and 
one-forms  on  lRn.  The  vector  fields  and 
the  one-forms  in  this  paper  may  be  varying 
in  time,  namely,  their  coefficients  may  be 
functions  of  time  when  they  are  expressed 
in  a  fixed  coordinate  frame. 

For  each  smooth  scalar  function  ;  on  IRn 
the  differential  operator  d  maps  ;  into 
a  one-form  d?  defined  by 
Oc/Sx^dx!  +  .  .  .  +  ( 3  c/3xn)  dxn.  For  a 
vector  field 

f  =  f ; (3/3X;)  +  .  .  .  +  fn(3/3xn)  and  a 
one-form  u  »  u,  dx, .  .  .  +  un  dxn,  the 
dual  product  of  u>  and  f,  denoted  by 
<  w,f  >,  is  a  scalar  function  defined  by 
w»fi  ■*••••+  cin*n • 

In  the  course  of  the  development,  several 
types  of  derivatives  with  respect  to  a 
vector  field  will  be  used.  We  state  the 
definitions  with  a  given  vector  field  f. 
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(1)  For  a  scalar  function  ?  on  ®n, 

L°f (?)  -  ?,  L‘f(;)  -  <d?,f>,  and 
Lnf  (?)  =*  LJf (Ln"‘f (?)  )  . 

(2)  For  a  one- form  w  on  3Rn, 
a?°f(ai)  *  u,  J?‘f  -  d<  (i»/f  >,  and 
2nf  (U)  -  2lt  U?n“lf  (01))  . 

For  a  time-varying  vector  field  g,  we 
have  two  types  of  derivatives: 

(3)  (ad° f ,g)  -  g, 

(adlf,g)  ■  (3f/3x)g  -  (3g/3x)f,  and 
(adnf ,g)  *  (adlf,  (adn-lf,g)). 

(4)  (r°f,g)  =*  g, 

(r‘f,g)  *  (ad*f,g)  -  3g/3t,  and 
(rnf,g)  -  (rlf,  (rn"lf ,g) ) . 

An  important  formula  involving  the  deriva¬ 
tives  of  types  (1),  (2),  and  (3)  is 

(2l  f  (iu)  ,g>  ■  <  w,  (adlf-,g)  >  +  L‘f  «  w,g>) .  (D 

Now  we  are  ready  to  study  the  problem  of 
characterizing  those  systems  that  are 
^■-equivalent  to  the  system  E ,  • 

If  a  system  E:  x  *  f(x,u,t)  is 
^"-equivalent  to  the  system  E0  with  a 
transfomation  T  =  (T,,T2,  .  .  .,Tn,Tn+l), 
then,  from  Proposition  1  we  have 

n  3T.  3T- 

f  j  (x'u't>  +  TT  U,t)  *  Ti+i  (x,t> 

i  *  1,  .  .  . ,  n  -  1  (2) 


f(x,u,t)  -  f(x,t)  +  g(x,t)  •  t!x,u,t) 

for  some  vector  fields  f  and  g  and  some 
scalar  function  <J>  with  f(0,t)  =  0, 
$(0»0,t)  *  0,  and  3$/3u  *  0  for  any  t. 

From  now  on  we  will  only  examine  systems 
of  the  form  x  ■  f(x,t)  +  g(x,t)u;  this 
will  not  result  in  any  loss  of  generality 
(see  ref.  3) .  A  system  will  also  be  rep¬ 
resented  by  a  pair  (f,g)  of  time-varying 
vector  fields. 


From  equations  (2)  and  (3),  a  system  (f,g) 
is  /-equivalent  to  the  system  Ec  if  and 
only  if  there  is  a  transformation 
T  =  (T,,  .  .  . , Tn , Tn+ , )  such  that  the  fol¬ 
lowing  equations  hold. 


rv 

3T 

E 

*7 

(fi 

(x, t)  +  g^  (x,  t) u) 

3Ti 

+  -55-  (x,t)  -  Ti  + 

for 

i  = 

1, 

.  .  . ,  n  -  1 ,  and 

n  3T 

E  33T1  (f  j  (5c,t)  +9j(x't)u) 


3Tn 

+  -55-  (x,t)  *  Tn+1(x,u,t).  (5) 

In  the  terminology  introduced  early  in  this 
section,  a  transformation  T  satisfies  the 
system  of  equations  (4)  and  (5)  if  and  only 
if  T  satisfies 


(  div , g  >  *  0 ,  i  =  l,  .  .  .,  n  -  1,  (6: 


n  3T  3T 

E  ^  fj(x,u,t)  +  (x,t)  «  Tn+l(*,u.t) 

3  (3) 

An  observation  similar  to  one  in  refer¬ 
ence  3  is  given  as  follows.  The  n-1 
equations  in  (2)  say  that  for  each  fixed 
pair  of  state  and  time  (x,t),  the  n  com¬ 
ponents  fj,  .  .  .,fn  of  f,  considered 
as  variables  of  u,  satisfy  n-1  linear 
equations  with  constant  coefficients. 

This  leads  to  the  following  theorem. 

Theorem  1:  If  a  system  x  =  f(x,u,t)  is 
•/-equivalent  to  2.,  then  f  can  be 
excr^ssed  in  the  form 


3T . 

<dTi'f>  +  TT  =  Ti+ 1 


i  -  1 ,  •  •  •  »  n  *  1 » 

.7! 

(  dT^, f  +  gu  )  + 

Sts 

II 

3 

+ 

(8) 

<  dTn,g  >  *  0 . 

(9' 

Considering  the 

case  i  *  2,  equations 

’.6) 

and  (7)  state  that 

<  dT2 ,g  >  *  Ot 

(10' 

<  dT; 

*  T 

<  f  >  +  a  Tj . 

(11! 

Letting  dT;  * 

i  and  substituting  ill: 

into  (13) ,  we  obtain 
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(12) 


(•S?1  f  (id)  ,g  >  +  (d  (-^-) , g^>  *  0 

By  formula  (1) 

(&‘f(u>)  ,g>  *  <  id,  (ad*f,g)  >  +  Llf  (<  w,g>) 
and  <iD,g>  =  0,  we  have 

(&l£(u),g>  ■  <  id,  (ad*f  ,g)  >.  Because  the 
operators  d  and  3/3t  commute,  we  have 
d ( 3Ti/3t)  »  3/3t(dTi)  -  3w/3t.  The  second 
term  in  equation  (12)  then  is  <  3b>/3t,g>, 
and  the  fact  that  <w,g>  •  0  implies 
(3<u,g>)/3t  *  0.  By  the  chain  rule. 


given  in  reference  3.  We  remark  that  the 
time-varying  version  of  the  Frobenius 
theorem  plays  a  crucial  role. 

Theorem  2:  A  time-varying  system 

E:  x  ■  f(x,t)  +  g (x,t) $ (x,u, t) 

(as  defined  in  Theorem  1)  is  ^-equivalent 
to  the  system  E0  if  and  only  if 

(1)  The  vector  fields  g, 

(T‘f ,g) ,  .  .  .,  (Tn“lf,g)  span 
]Rn  at  the  origin  at  any  time  t, 
and 


(2)  The  vector  fields  g, 

( T 1 f , g )  ,  .  .  .,  (rn-2f,g)  are 
involutive  near  the  origin  for 
any  t. 

4 .  CONCLUDING  REMARKS 


Summing  up  these  observations,  equation  (12) 
becomes 


<  w, (ad'f ,g)  > 


0. 


In  terms  of  the  operator  f ,  it  can  be 
expressed  as  <  w, (Flf,g)  >  *  0.  Similar 
computations  change  equations  (6)  and  (9) 
into,  respectively, 

'  'V . , ( rAf , g)  >  *  0, 

i  ■  0,1,  .  .  . ,  n  -  2  (13) 


and 


<  dT , , (7n“ 1 f ,g)  )  *  0.  (14) 


It  then  can  be  readily  checked  that  the 
existence  of  a  transfc rmation  T  satis¬ 
fying  equations  (6) -(9)  is  equivalent  to 
the  existence  of  a  scalar  function  Tj 
such  that  equations  (13)  and  (14)  hold. 
Except  for  the  use  of  the  time-varying 
version  of  the  Lie  derivative,  this  result 
is  exactly  parallel  to  that  obtained  in 
me  time-invariant  case  in  reference  3. 


Now  ve  are  ready  to  state  the  main  theorem 
which  characterizes  me  equivalence  class 
in  which  we  are  interested.  The  proof  is 
emitted  because  of  its  similarity  to  that 


The  conditions  in  Theorem  2  describe  a 
special  ^-equivalence  class  of  time- 
varying  systems.  For  a  nonlinear  system 
in  this  equivalence  class  we  can  solve  the 
system  of  partial  differential  equa¬ 
tions  (13)  and  (14)  for  a  transformation' 
that  will  turn  the  system  into  a  series  of 
integrators;  that  is,  the  system  E, .  The 
system  E0  is  not  only  linear  but  also 
time  invariant.  Notice  that  the  transfor¬ 
mation  is  varying  in  time  in  order  to 
''cancel"  the  time  dependency  of  the  sys¬ 
tem.  It  is  not  surprising  that  a  time- 
varying  version  of  Lie  derivative  T  has 
to  be  used  to  generalize  the  result  in 
reference  3.  This  operator  T  in  its 
linear  version  has  previously  appeared  in 
the  literature;  for  example,  see  Hermes.^6' 
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Abstract 

Recent  results  have  classified  those  nonlinear  systems  that  cam  be 
transformed  to  linear  systems.  For  single- input  systems  of  the  form 

x(t)  -  f  (x (t) )  +  u (t)  g  (x (t) ) 

we  assume  that  the  vector  fields  g,  [f,g],  .  .  .,  (adn_2f,g)  are 
involutive  and  that  g,  [f ,g] ,  .  .  .,  (adn-1f,g)  are  linearly  inde¬ 
pendent  in  a  neighborhood  of  the  origin  in  Hn.  It  is  shown  that 
the  transformation  mapping  this  system  to  a  linear  system  exists  by 
virtue  of  the  famous  Poincare  lemma  from  differential  geometry. 


1.  INTRODUCTION 

Because  of  the  extensive  literature  con¬ 
cerning  time- invariant  linear  systems,  it 
is  interesting  to  characterize  the  non¬ 
linear  systems  that  can  be  transformed  to 
these  linear  systems.  In  this  paper  we 
consider  single-input,  time-invariant  non¬ 
linear  systems  of  the  form 


f (x(t))  +  u (t) g (x  (t) ) 


where  f  and  g  are  *  complete  vector 
fields  on  an  open  set  in  ®n  containing 
the  origin  and  f ( 0 )  =0.  The 
formations  of  interest  to  us  are 

T  -  (Tx,  T2 . Tn+l),  which  map 

CXi,  .  .  .,  Xn,u)  space  to 


trans- 


If  we  think  of  Tn+I  as  being  the  control 
in  equation  (2) ,  this  system  is  in  inte¬ 
grator  form  with  T2,  T2,  .  .  . ,  Tn  being 
the  space  variables.  We  want  (i)  T  to 
have  a  nonsingular  Jacobian  matrix, 

(ii)  T (0)  =  0,  (iii)  Tx,  T2,  .  .  . ,  Tn  to 
be  functions  of  xl#  x2,  .  .  .,  xn  only 
and  to  have  a  nonsingular  Jacobian  matrix 
in  these  variables,  and  (iv)  Tn+1  to  be  a 
function  of  x, ,  .  .  .,  x„,  u  which  can  b« 


(T„  T2, 


space  so  that  the 


inverted  as  a  function  of 


system  (1)  is  mapped  to  the  linear  system  we  can  also  ask  where  the  T 

tion  is  a  dif feomorphism. 
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In  addition 
transforma- 


iriLCLui.tCr  £/*.  vifi 


Necessary  and  sufficient  conditions  for  the 
local  existence  of  such  a  transformation 
are  given  in  reference  1,  and  a  constructive 
proof  of  the  transformation  in  addition  to 
global  results  is  found  in  reference  2.  The 
purpose  of  this  paper  is  to  show  that  the 
existence  of  such  a  mapping  depends  on  the 
application  of  the  Poincare  lemma  from 
differential  geometry. 

Several  authors  have  examined  the  equiva¬ 
lence  of  nonlinear  systems  and  linear  sys¬ 
tems  under  various  assumptions.  Meyer  and 
Cicolani  considered  the  block- 

triangular  nonlinear  (possibly  time-varying) 
systems;  Krener^5'  gave  conditions  for- a 
nonlinear  system  to  be  transformed  to  a 
linear  system  under  state  space  coordinate 
changes ;  and  Brockett  *  * )  studied  the  equiv¬ 
alence  of  nonlinear  and  linear  systems 
under  coordinate  changes  and  additive 
feedback.  The  transformation  theory  devel¬ 
oped  in  references  1  and  2  contains  the 
results  from  the  authors  just  mentioned. 

In  section  2  of  this  paper  we  give  defini¬ 
tions  and  study  the  system  of  linear  par¬ 
tial  differential  equations  from  refer¬ 
ence  1  that  determines  the  existence  of  a 
transformation  of  the  type  that  is  of 
interest  to  us.  Section  3  contains 
examples,  the  construction  of  the  trans¬ 
formation,  and  several  applications  of  the 
Poincare  lemma . 

2.  DEFINITIONS  AND  TECHNIQUES 

We  give  basic  definitions  and  examine  the 
technique  in  reference  1  that  proves  the 
existence  of  the  transformation. 

lotting  X  and  Y  be  vector  fields  on  En 
(or  on  an  open  subset  of  Hn)  ,  we  define 
the  He  bracket  of  X  and  Y 

!*■*!  ■  Sx  -  H  *  • 

where  ;Y/cx  and  jX/rx  are  n  *  n  Jacobian 
matrices.  Successive  lie  brackets  like 


(X, [X,Y] ] ,  [Y, (X, YJ ] ,  etc.,  can  be  intro¬ 
duced  ,  and  we  set 

(ad°X , Y)  -  Y  , 

(adlX, Y)  -  [X, Y)  * 

(ad*X,Y)  -  (X, [X, YJ ]  , 


(ad^Y)  -  [X,  <«dk“*X,Y)J-  . 

A  set  of  <f  "  vector '(fields 

{Xx ,  .  .  IN  Xr)  on  ®n  is  involutive  if 

there  exist  functions  Y^^tx)  such 

that 

[X^Xj]  (x)  ■  Y^ jjj (x) Xj^ (x)  , 

:  1  S  i,j  £  r,  i  )*  j  . 

The  classical  Frobenius  theorem  states  that 
given  a  point  x„  £  »n  and  an  involutive 
set  (X^ ,  ,  Xr)  of  linearly  indepen¬ 

dent  vector  fields  on  Kn,  there  is  a 
unique  maximal  r  dimensional  (f*  sub¬ 
manifold  S  of  ®n  containing  x0  so 
that  the  tangent  space  to  S  at  each 
x  6  S  is  the  space  spanned  by 
Xj ,  .  .  . ,  Xr  at  x.  We  say  that  S  is 
the  integral  manifold  of  Xx ,  .  .  . ,  Xr 
through  x0. 

A  ^  °°  n  dimensional  manifold  M  is 
called  (smoothly)  contractible  to  a  point 
X.  s  M  if  there  is  a  ff"-  function 

H:  Mx  [0,1]  *  M 

such  that 

H  (x , 1)  *  x  for  x  s  M 
H (x,0)  ■  x,  . 

Of  course,  lRn  is  smoothly  contractible  to 
the  origin,  and  a  star-shaped  region  with 
respect  to  a  point  x.  is  contractible  tt 
that  point  (see  ref.  7) . 

Let  u>  be  a  k-form  on  M,  1  <  k  <  n  -  1, 
and  let  d  be  the  standard  differential 
operator  mapping  -  to  a  k  +  1  form  (m 
our  theory  j  is  a  cne-form) .  A  k-form 
w  is  called  closed  if  d*  *  0  and  exact 


»  o. 


i  *  1  /  •  • ,  n  *  *1  t 


if  u)  *  dn  for  some  k  -  1  form  n  on 
M.  This  leads  to  a  famous  result  of 
Poincare,  which  we  state  for  V°°  forms. 

Lemma  1:  If  M  is  smoothly  contractible 
to  a  point  x0  s  M,  then  every  closed  form 
u  on  M  is  exact. 

In  later  applications  of  this  result  M 
is  an  open  neighborhood  of  the  origin  in 
3Rn.  Recall  that  we  are  interested  in  a 


transformation  T  *  (Tx,  T2, 
which  takes  the  system 


*  *  ' 


f  +  ug 


to  the  system 


L  s*5  93 

3*1 

Zj  3X1  ? 


—  1  f 


iz;  (f j  4 


Solving  this  system  of  equations  is  shown 
to  be  equivalent  to  finding  a  func¬ 
tion  Tx  such  that 


<dTx,  (adKf ,g)>  -  0  , 
<dTx,  (adn“1f,g))  i*  0  , 


k»0,  1,  .  .  .,n-2 


so  that  the  conditions  (i)  through  (iv) 
(specified  in  the  Introduction)  hold.  By 
design,  this  mapping  will  be  a  diffeomor- 
phism  near  the  origin;  conditions  under 
which  we  have  a  global  diffeomorphism  are 
discussed  in  reference  2.  Necessary  and 
sufficient  conditions  for  the  local  exis¬ 
tence  of  such  a  map  are  that  (a)  the  matrix 
(g,  [f,g],  .  .  (adn_1f ,g| }  has  rank  n 
in  some  neighborhood  of  the  origin  in 
(xx,  Xj,  .  .  .,  xn)  space,  and 
(b)  the  set  of  vector  fields 

g,  (f,gl,  .  .  .,  (adn_Jf,g)  is  involutive 

in  some  neighborhood  of  the  origin  in  the 
same  space.  By  condition  (a)  above, 
g,  [f,g],  .  .  .,  (adn“lf,g)  are  linearly' 

independent,  and  the  Frobenius  theorem'' 
implies  the  existence  of  a  V*  n  -  1  \ 
dimensional  integral  manifold  of 
g,  lf,g],  .  •  (adrv"if,g),  using  condi¬ 

tion  (b)  above. 

From  reference  1  we  know  that  a  transfor¬ 
mation  T  »  (Tx,  Tx,  .  .  .,  Tn)  must  \ 
satisfy  the  system  of  partial  differential 
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where  {.,.)  denotes  the  duality  of  one- 
forms  and  vector  fields.  Thus  a  transfor¬ 
mation  satisfying  conditions  (i)  through 
(iv)  (specified  in  the  Introduction)  exists 
if  we  can  find  such  a  Tx  that  vanishes  at 
the  origin. 

3 .  GENERAL  RESULTS 

We  show  that  the  Poincare  lemma  can  be 
used  to  discover  all  of  the  functions 

Tx  satisfying  equations  (4)  under  the 
assumptions  that  the  matrix 
{9,  [f,g]/  .  •  (adn~lf ,g) }  has  rank  n 
and  the  set  g,  [f,g],  .  .  .,  (adn"*f,g) 
is  involutive.  The  transformations 
T  =  (Tx,  Tx,  .  .  .,  Tn+l)  with  condi¬ 
tions  (i)  through  (iv)  (from  the  Introduc¬ 
tion)  holding  can  be  found  by  applying 
equations  (3) . 

Example  1:  We  examine  the  nonlinear 


system 


x j  sin  x2  0  *  f (x)  + 

a  +  U  i 

xj  0  J  ^1_ 


ug(x) 


on  3R:.  Computing  the  first  Lie  bracket 
we  have 


and  g  and  [f,g]  are  linearly  independent 
on  {(xx,x2):  — tt / 2  <  x2  <  it/2}  -  U. 
Certainly  the  vector  field  g  ■  is  non¬ 
vanishing,  so  there  does  exist  a  trans¬ 
formation  (Tx ,T2 ,T j )  with  {xx,x2)  in  a 
neighborhood  of  the  origin  in  3R2  [<TX,T2) 

is  a  diffeomorphism  for  a  sufficiently 
small  neighborhood] . 

We  show  that  the  Poincare  lemma  implies 
the  existence  of  a  transformation  with 
(xx,x2)  in  CJ.  We  need  to  demonstrate 
the  existence  of  a  ff“  function  Tx 
satisfying  equations  (4).  Consider  the 
one-form  a>  *  1  dxx  +  0  d*2  on  0.  Since 
it  is  closed  and  U  is  contractible  to 
the  origin,  there  is  a  if"  function  Tx 
satisfying  dTx  »  oj  by  Poincare.  Now 
<dTx,g)  *  0,  and  since  g  and  [f,g]  are 
linearly  independent  on  U  and  dTx  #  0, 
<dTx,[f,g]>  #  0.  The  transformation  T 
exists  for  (xx,x2)eU,  and  taking  Tx 
with  Tx (0)  «  0  conditions  i)  through  iv) 
hold  as  desired.  Such  a  transformation  is 
(Tx,T2,T3)  =  (xx,sin  x2,(cos  x2)u),  which 
is  one-to-one  on  U. 

Example  2:  Consider  the  system  oii  3R2 


X. 

ff, (xT 

”gx  <*> 

*  I  * 

+  u 

X, 

— 

l_f  2  (X) 

_g2  (x) 

Assume  that  there  exists  a  smoothly  con¬ 
tractible  neighborhood  U  of  the  origin 
in  (x, ,x2)  space  on  which 

1.  g  is  nonzero  and 
•c./'-x^  »  -(jgj/'X^ 

2.  g  and  [f,g]  are  linearly 
independent. 


T 


Example  3 :  Suppose  we  have 

x. 


fx(xf|  ("g.  (x) 
+ 

_f  2  (x)J  |_g2(x)_ 


f  +  ug 


on-  »2.  Assume  there  exists  a  smoothly 
contractible  neighborhood  0  of  the  origin 
in  (xx,x2)  space  on  which 


1.  g  is  nonzero 

2.  g  and  [f,g]  are  linearly  independent 

3.  [ [f ,g] ,g]  *  ag  for  some  V  “ 
function  a. 


Assumption  (3)  above  is  due  to  Brockett^6* 
and  is  one  of  his  conditions  for  a  trans¬ 
formation  (using  coordinate  changes  and 
additive  feedback)  to  a  linear  system.  In 
reference  1  it  is  shown  that  a  transforma¬ 
tion  of  the  type  discussed  in  this  paper 
exists  <n  a  neighborhood  of  the  origin 
under  assumptions  (1)  and  (2)  only.  It  is 
interesting  to  note  that  Example  1  does 
not  satisfy  assumption  (3) . 

We  take  the  one- form 

gi  , 

“  *  det  C  dXl  ~  det  C  dx*  ' 


where  C  is  the  2*2  matrix  with  columns 
g  and  (f ,g] ,  which  is  nonsingular  on  U. 
This  form  is  closed  if  and  only  if 


J  g*  \  3  ( 

gi 

\ 

3  \  det  C)  3  \ 

,  det  C 

) 

3  X  j 

3x, 

Letting 

[f,g]  -  h,  [h,g] 

■  ag 

gives 

3g!  h 
3x;  hi 

+  3gi  h  5hx 

3Xj  h:  *  3x,  gi 

3  h 

_ i 

3x, 

gx  m  ogx 

and 

3g,  3h, 

3h2 

ng”. 

+  3x2  h2  ~  3xx  g: 

"  3x2 

g.  *  io. 

*  *•  "  »■ 

Substituting  these  last  two  equations  into 


The  cne-form  u,  ■  g.  dx2  -  g,  dx2  is  closed 
on  tl  (d_  >=  (:-g./?x.  +  3g./3x2)dx2  A  dx1)), 
so  there  exists  a  *  function  Tx  [we 
take  T: (0)  »  0]  so  that  dTx  «  ».  Again 
iT  , g  *  0  and  dTx,[f,g]>  #  0  by 


we  get  0;  that  is,  assumption  (3) 
that  the  one-form  is  closed  on 
the  Poincare  lemma  there  exists  a 


implies 
f.  3y 


74 


V  ”  function  Tx  on  0  with  dT2  *  w.’ 
The  facts  that  <dT2,  w)  *  0  and 
< dT2 ,  [f,g])  #  0  follow  easily. 

We  consider  the  general  case  for  equa¬ 
tions  (1)  and  suppose  that  the  rank  assump¬ 
tion  on  the  matrix 

(g ,  [f,g],  ...»  (adn"1f,g)}  and  the 
involutive  assumption  on 
g,  [ f  ,g]  ,  .  •  (adn"*f,g)  hold  on  some 
open  neighborhood  of  the  origin  in 
(x1#  Xj ,  .  .  . ,  xn)  space.  We  seek  a 
solution  T2  of  equations  (4).  It  is 
important  to  remember  that  first-order 
linear  partial  differential  equations  are 
solved  by  reducing  them  to  systems  of 
ordinary  differential  equations.  At  each 
stage  of  our  constructive  procedure  we 
give  an  equation  for  Tx  to  satisfy,  but 
we  wait  until  a  construction  is  completed 
and  let  the  Poincare  lemma  give  a  solu¬ 
tion  Tj  to  all  of  these  equations. 

Since  f  and  g  are  complete  and 

g.  [f.gl,  •  •  •«  (adn“1f ,gj  are  linearly 

independent,  we  know  that 

[f,g],  .  .  .,  (adn-1f,g)  are  complete. 

For  all  s  €  K  we  solve  the  system 
-  (adn_1f ,g) 


with  initial  conditions  x { 0 )  *  0  to 
obtain  the  unique  integral  curve  x(s)  of 
(adn-1f,g)  through  the  origin  in 
(x,,  Xj,  .  .  .,  xn)  space.  The  partial 


differential  equation 
solved  by  considering 
the  system 


We  denote  by  x(s,tx) 
the  first  system  dx/ 
conditions  x(s,0)  * 


<dTlf  g)  =  0  is 
for  all  tj  £  S 


the  solution  of 
t:  *  g  with  initial 
(s)  . 


We  then  examine  for  all 


:  ,g! 


t.  €  the  system 


In  this  way  we  deal  with  the  partial  dif¬ 
ferential  equation  <dT2,  [f,g]>  =0.  The 
solution  of  dx/dt2  =  [f,g]  with  initial 
conditions  x(s,t1,0)  *  x(s,t1)  is 
x(s,tl,tl).  Continuing  this  process,  the 

last  step  involves  the  system 

* 


dx 


dt, 


n-i 


(adn-Jf ,g) 


dT  j. 


dt 


n-i 


which  is  associated  with  the  partial  dif¬ 
ferential  equation  (At2,  (adn  -*f,g)>  -  0. 
By  x(s,  tt,  .  .  .,  tn_2)  we  denote  the 
integral  curves  of  the  vector  field 
(adn_2f,g)  with  s,  t2 ,  .  .  .,  tn_2 
behaving  like  parameters. 


Since 

g,  [f 

t  1  t  *  • 

.,  (adn-1f,g] 

linearly  independent, 

the  matrix 

*  5xt 

3*i 

3xi 

TF 

3ti  ' 

’  '  3tn-i 

3x2 

3x2 

3x2 

3s 

~  • 

'  '  3  ^n— i 

3xn 

3xn 

• 

3xn 

3s 

3ti 

’  *  3tn-x  J 

called  the  noncharacteristic  matrix,  is 
nonsingular.  Thus  s,  t2,  .  .  .,  tn_. 
can  be  solved  for  x  ,  xjf  .  .  .,  xn  in  a 
neighborhood  of  the  origin.  We  denote  the 
map 

(s , t j ,  •  •  . , )  —  (x, (s , t2 ,  .  .  • , f )  » 

X.(s,tj,  •  .  .  ,  t  j}..  2  )  ,  .  .  ., 

xn  *s ' '  •  "  ■ ' fcn-i ) ) 

by  F  and  note  that  its  Jacobian  matrix  is 
the  noncharacteristic  matrix. 

We  are  now  ready  to  prove  our  result  con¬ 
cerning  mappings  of  nonlinear  systems  to 
linear  systems. 

Theorem  1 :  Let  U  be  an  open  subset  in 
(x, ,  Xj ,  .  .  . ,  xn)  space  containing  the 

cngin  and  suppose  that 
(a)  a,  (f ,g]  ,  .  .  .,  (adn-lf  ,gl  are 
linearly  independent  cn  U,  and 
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(b)  g,  [f,g],  ...»  (adn_2f,g)  are 
involutive  on  U.  Also  assume  the  mapping 
F  is  one-to-one  and  that  the  noncharac¬ 
teristic  matrix  is  nonsingular  on  an  open 
subset  V,  which  is  smoothly  contractible 
to  the  origin  in  (s,  tx,  .  .  .,  tj^) 
space  and  with  U  c  F(V)'.  Then  there 
exists  a  transformation 

T  -  (Tx,  Tt,  .  .  Tn+1):U  ♦  Rn+1  taking 

system  (1)  to  system  (2)  so  that  (i)  T  has 
a  nonsingular  Jacobian  matrix  xith  respect 
to  (xx,  .  .  .,  xn,u) ,  (ii)  T (0 )  *  0, 

(iii)  Tx,  Tz,  .  .  . ,  Tn  are  independent 
of  u  and  have  a  nonsingular  n  *  n 
Jacobian  matrix,  and  (iv)  Tn+1  is  a  func¬ 
tion  of  (xx,  .  .  xn,u)  which  can  be 
inverted  as  a  function  of  u. 


<3>  (adn_2f,g)  as 

tj ,  tj,  .  .  .  ,  tn_j  vary.  Now,  T3  is 
constant  on  each  integral  manifold  and 
must  solve  <dT1,g>  *  0, 

<dT1,(f,g)‘)  =  0,  .  .  .,  <dTx ,  (adn"2f  ,g) )  0 

as  functions  of  s,  t1#  .  .  tn_x.  Our 
assumption^  on  F  and  the  noncharacteris¬ 
tic  matrix  imply  that  we  can  solve  for 
(s,  tx,  .  .  . ,  tn-i)  as  functions  of 
(xx,  Xj,  .  .  xn)  and  hence  the  desired 
partial  differential  Equations  are  solved 
on  U. 


Since  g,  [f ,g] ,  .  .  .,  (adn-2f,g)  are 
linearly  independent  on  0  and 


3T- 


dT, 


_ i  _3_s 

3s  3x 


3T,  3s 

,  * -sr  ^ 


Proof:  Recall  that  we  need  to  find  a  V 
function  Tx  that  vanishes  at  the  origin 
in  (xx,  x2,  .  .  .,  xn)  space  and  satis¬ 
fies  equations  (4) .  We  have  constructed 
the  function  F(s,  tx ,  .  .  .,  tn-1)  whose 
Jacobian  matrix  is  the  noncharacteristic 
matrix. 

Let  4>(s)  by  any  function  of  s 

which  does  not  vanish  on  V  (think  of  41 
as  defined  on  V  because  of  its  indepen¬ 
dence  in  (tx,  t2,  .  .  .,  tn_x)).  Con¬ 
sider  the  one- form 

j  =  <p(s)ds  +  0  dtx  +  .  .  .  +  0  dtn-x  on 
V.  Since  it  is  obviously  closed  on  V 
there  exists  a  V  °°  function  Tx  (s) 
satisfying  dTj  *  w  such  that  Tx (0)  *  0. 


Certainly 


ct 


T,  3T. 

*  =  0, 


3T, 


3t. 


0, 


3t_ 


-2  -n-i 

on  V,  but  how  does  this  relate  to  the 
equations  <dT;,g>  *  0, 

'dr.,[f,g]>  -  0,  .  .  . 'dTx , (adn-2f ,g)> «  0? 


Since  g,  [f,g],  .  .  (adn_2f,g)  are 
involutive  on  U,  the  Frobenius  theorem 
and  our  construction  of  the  map  F  imply 
that  for  each  fixed  s  we  have  an  n  -  1 
dimensional  integral  manifold  of 


3Ti  3s 

+  3  s  3xn  dxn 


$ (s)ds 


"V 

"sin  x2" 

"0" 

*2 

- 

sin  x3 

+  u 

0 

.*3. 

_  0 

_1_ 

is  nonzero  on  U,  the  equation 

<dT2 , (adn_1f ,g) )  ^  0  holds  on  the  set  U. 

We  conclude  with  the  following  example. 

Example  4:  We  take  the  nonlinear  system 


f (x(t) )  +  ug  (x  (t) ) 


on  ]R3.  Computing  Lie  brackets  we  have 

"O'" 

[f,g]  =  -  COS  Xji 

_  0  J 

"cos  Xj  cos  x3 
(ad2f,g>  -  0 

0 

0 

[g ,  [ f  , g ] ]  *  sin  x3 

_  0  J 

Thus  conditions  (a)  and  (b)  of  Theorem  1 
are  fulfilled  on  the  open  set 


U 


(x3  ,x2  ,x3 )  €  JR3  :  -  j  <  x. 


<  2 


— 1  <  V  <  —  l 

2  X-’  2  I  • 


76 


The  solution  of 


dx 

ds 


(ad2f,g)  , 


x  (0)  =  0 


is  x^s)  *  s,  x2(s)  =  0,  and  x3  (s)  *  0. 
The  system 


g 


“Roger  Brockett  presented-  results  concern¬ 
ing  mappings  of  nonlinear  systems  to. 
linear  systems  and  provided  a  construction 
similar  to  that  preceding  Theorem  1  at  a 
1978  CBMS  conference  held  at  the  University 
of  California,  Davis. 
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TRANSFORMING  NONLINEAR  SYSTEMS* 


L.  R.  Hunt  And  Renjeng  Sut 


ABSTRACT 

We  consider  nonlinear  systems  of  the  form 

m 

•  x(t)  =  f(x(t))  +  J  u. (t)g. (x(t)) 

i=l  1  1 

co  n 

where  f,  g. ,  g  are  C  complete  vector  fields  on  1R  and  f(0)*0. 

Because  of  the  amount  of  literature  devoted  to^the  study  of  linear  time 
invariant  systems,  it  is  reasonable  to  aslc  necessary  and  sufficient  con¬ 
ditions  for  the  above  system  to  be  transformable  to  a  linear  system. 

Using  such  a  transformation  we  could  construct  a  regulator  for  the  non¬ 
linear  system  by  building  ore  for  the  linear  system  (G.  Meyer  has  done 
this  in  his  study  of  automatic  fligh  .  con croJ .  It  is  the  purpose  of 
this  paper  to  find  conditions  (depending  on  Lie  brackets)  for  a  trans¬ 
formation  to  exist.  Basically,  we  choose  a  canonical  form  for  a  linear 
time  invariant  system  and  investigate  the  possible  mapping  of  our  non¬ 
linear  system  to  that  canonical  form. 

I.  Introduction 

Suppose  we  have  a  nonlinear  plant  that  we  are  to'  control  to  perform 
some  task.  For  example,  an  aircraft  which  is  designed  to  automatically  fly 


*Research  supported  by  NASA  Ames  Research  Center  under  the  IPA  Program  and 
the  Joint  Services  Electronics  Program  at  Texas  Tech  University  under  ONR 
Contract  N00014-76-C-1126. 

tResearch  Associate  of  National  Research  Council  at  Ames  Research  Center. 
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A 

*_'• 

a  designated  path  despite  modelling  errors  and  disturbances  (see  [1] ,  [2] , 
[3]).  We  consider  the  mathematics  associated  with  such  a  problem. 

Assume  the  dynamics  of  our  plant  are  described  by  the  system 

ar 

(1)  x(t)  -  h(x,t,u) , 

where  x  c  IRn  and  h  is  a  complete  C*  vector  field.  Our  research  involves 
multi- input  and  time  varying  systems,  but  to  save  notation  we  emphasize 
the  single  input  and  time  invariant  system 

(2)  x(t)  -  h(pc,u) , 

and  mention  the  more  general  results. 

Since  the  design  theory  for  controllable  linear  time  invariant  systems 
is  treated  in  the  literature,  if  our  nonlinear  system  is  equivalent  (say 
using  coordinate  changes  and  feedback)  to  a  controllable  linear  system,  then 
wc  can  use  this  fact  in  our  control  problem. 
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Thus  we  are  interested  in'  characterizing  those  nonlinear  systems  which 

are  transformable  to  controllable  linear  systems.  The  transformations  we 

consider  are  maps  T  *  (T. ,T„, • •• ,T  )  which  take  V  *  1R  (with  variables 

1  £  n+1 

(x, ,x„,***»x  ,u)),  where  V  is  an  open  neighborhood  of  the  origin  in  state 
1  2  n 

space,  onto  an  open  set  in  3Rn+^  ((T^,T2,*  *  *»Tn+^  space)  containing  the 
origin,  so  that  the  following  properties  hold: 


i)  T(0)  *  0 

ii)  T, * !„,•••, T  are  functions  of  x,  >x.,»***»x  only  and  have  a  nonsin- 
1  £  n  l  £  n 

•  gular  Jacobian  matrix  on  V 

3T  +1 

iii)  Tn+1  is  a  function  of  xjL»X2' ‘* * »xn,u  for  which  3u-'  is  nonzero  on 
V  x  IR 

iv)  T1,T2,***,Tn+1  satisfy 


—  " 

—  " 

“*  - 

T  *  T 

L1  2 

fl 

T2 

0 

T  ’  "=»  T 

• 

T 

T, 

0 

2  3 

2 

3 

• 

• 

, 

• 

(3)  .  or 

• 

- 

• 

+  T 

n+1 

• 

• 

• 

• 

• 

i  »i  „ 

• 

T 

0 

1 

n  n+1 

n 

_  _ 

that  is,  "  *  *  ,Tn  are  state  var*ables  and  T^+^  the  control  for  our 

controllable  linear  system 


v)  T-(TlfT2, 


,T  .. )  is  one-to-one  on  V  n  E. 
n+1 
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Results  in  this  paper  are  local  (near  the  origin  in  state  space) , 
but  global  theorems  have  been  proved  in  [4] . 

It  is  shown  in  [5]  that  if  the  system  (2)  is  transformable  to  the 
system'  (3) ,  then  it  can  be  "reduced"  to 

(4)  i(t)  *  f(x(t))  +  u(t)g(x(t)), 

» 

where  f  and  g  are  C  vector  fields  on  V,  and  we  assume  f(0)  =  0.  Hence  we 
wish  to  map  system  (4)  to  system  (3). 

For  related  results  concerning  the  transformations  from  nonlinear 
systems  to  linear  systems  we  refer  to  the  research  of  Krener  [6] ,  Brockett 
[7] ,  Meyer  [3] ,  Jakubczyk  and  Respondek  [8] ,  and  the  authors  [4] ,  [5] ,  [9] , 
[10] , [11] . 

In  section  H  we  give  basic  definitions  and  the  partial  differential 
equations  we  must  solve  to  build  a  transformation.  Section  III  contains 
the  main  result,  a  constructive  proof  of  a  transformation,  and  comments 
about  the  more  general  theory. 
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*  ■ 


n.  Definitions  and  Preliminaries 


»  n 

If  f  and  g  are  C  vector  fields  on  an  open  set  in  H  ,  we  define  the 
Lie  bracket  of  f  and  g 


[f  gi  ,  Me 

l£,8J  9xf  3x 8  * 


where  -r*  and  -r—  are  n  x  n  Jacobian  matrices.  We  let 
.  3x  3x 

(ad°f,g)  -  g 

(adLf ,g)  -  [f ,g] 

(ad2f,g)  -  [f,[f,gl] 

«  • 

(adkf,g)  -  [f»(adk”1f ,g)] . 

00 

For  h  a  C  function  on  V  we  define  the  Lie  derivative  of  h  with 
respect  to  f  as 

Lf(h)  -  <dh,f>, 

with  dh  being  the  gradient  and  <•»•>  being  the  duality  between  one  forms 

00 

and  vector  fields.  If  w  is  a  C  one  form,  then  we  have  the  Lie  derivative 
of  w  with  respect  to  f 

V»).  (gif )*♦*!.  . 
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where  *  denotes  transpose  and  and  are  Jacobian  matrices. 


A  relation  between  the  three  types  of  Lie  derivatives  just  defined  is 


C5)  Lf<w,g>  -  <Lf(w),g>  +  <w, [f,g]>, 

\ 

CD 

where  g  is  a  C  vector  field  and  f  and  w  are  as  before. 

In  [5]  it  is  proved  we  can  transform  system  (4)  to  system  (3)  by 

T  *  (Ti’T2’“*‘Tq+i)  if  and  only  if  Tl,T2*“*’Tn+l  haVS  linearly  ^depen¬ 
dent  gradients  and  satisfy 


<dT1,g>  -  0,  i*l,2, • • • ,n-l 


(6) 


<dT1,f>  -  Lf(Tt)  -  t1+1,  — 


<dTa,f+iig>  -  If+ug(V  -  In+1  . 


Using  the  formula  (5)  repeatedly  these  equations  become 


(7) 


<dT1, (adkf,g)>  -  0,  k*0,l,. . . ,n-2 

(-l)tl“1u<dT,  ,(adri_1f,g)>  +  <dT  ,f>  •  X  ... 

x  n  n*ri 


Hence  we  have  a  desired  transformation  T  if  and  only  if  we  can  find 
a  solution  Tj  of 


(8) 


<dT1,(ad  f,g)>  -  0,  k*0 ,1, • • * ,n-2 


<dT1,(adn"1ffg)>  #  0 


which  vanishes  at  the  origin.  The  remaining  coordinate  functions 
T-,T_, are  easily  derived  from  (6). 


III.  Construction  of  a  Transformation 


oo  n 

A  collection  of  C  vector  fields  a^x)  ,a2  (x)  , . . .  ,ar  (x)  of  R  is 
called  involutive  if  functions  exist  so  that 


Cai » a^ ] (x)  -  l  Yijk(x)ak(x) ,  l<i, j<r,i#j . 

Our  main  result  depends  on  the  assumptions 

« 

a)  the  set  {g, [f ,g] , • • • , (adn  if,g)}  spans  an  n  dimensional  space,  and 


n-2. 


b)  the  set  (g, [f ,g] »••• , (ad  f,g)}  is  involutive. 


Theorem  3.1  Conditions  a)  and  b)  hold  for  (x^,^,  *  *  *  »xq)  in  some  neighbor¬ 
hood  of  the  origin  in  1R n  if  and  only  if  there  is  an  open  set  V  C  ]Rn 
containing  the  origin  and  a  transformation  T  »  ^Ti,T2’***,Tn+l^  •'  v  x  ®  -*-R1 
such  that  properties  i)  through  v)  hold. 

This  theorem  is  proved  in  [5]  ,  and  we  illustrate  a  method  for  con¬ 
structing  such  a  transformation.  For  a  real  parameter  e  R  we  solve 


n+1 


d*(sl>  ,  .n-1,  . 

-jr - f.s) 


with  initial  conditions  x(0)  *  0.  Then  we  consider  for  s^  e  1  the  system 

dlt(sl,s2)  .  ,n-I.  .  , 
j—  ■  (ad  £.8  ) 

satisfying  xCs^.O)  »  xCs^.  Repeating  the  process  n-2  more  times  we  arrive 
at  the  final  system 

dx(s. , s. , . . .,s  ) 

- L_2 - a.  , 

ds_ 
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with  initial  conditions  x(s, ,s,, • • • ,s  , ,0)  *  x(s.  ,s,,***,s  .).  We  have 

x  i  n-i  .  l  z  n-l 

a  map 

(s1,»2,-*,sil)  -  (»1(s1,a2,--,sn),  x2(81,a2,‘“,sa),***,xn(s1,s2,-«,sii)) 

'i 

with  a  Jacobian  matrix  at  (0,0, •••,0)  equal  to  the  matrix  with  columns 

(adn  1f,g),(adn  ^f,g) ,*•*, [f,g] ,g.  Since  this  last  matrix  is  nonsingular 

at  the  origin  by  condition  a)  we  can  solve  for  (s, , $-,••♦,  s  )  as. functions 

12  n 

of  (x. ,x0, • • «vx  )  in  an  open  neighborhood  of  the  origin  using  the  inverse 
x  x  n 

function  theorem. 

If  we  can  find  a  function  which  solves  (8)  as  a  function  of 
s^»s2, ••• ,sq,  then  we  know  as  a  function  of  x^,x2> ,xq.  Because 
g, (f,gl ,*•*, (adn  2f,g)  are  involutive,  as  we  fix  s.  and  let  s0,s,,»»»,s 
vary  we  get  an  integral  manifold  of  this  involutive  set  by  the  famous 
frebenius  Theorem.  Hence  if  we  choose  ■  s^,  then  we  have  a  solution  of 
(8).  As  was  mentioned  earlier  *”  *Tn+i  are  ^ound  by  differentiation. 

An  illustration  of  this  technique  is  given  in  the  following  example. 

Example  3.2  Consider  the  nonlinear  system 


sin*2 

0 

x2 

m 

10 

x3“xl 

+  u 

0 

■  f(x(t))  +  u(t)g(x(t;)) 

A 

0 

1 

for  (x^.x^Xj)  e  V  with 

V  -  {(x^.x^Xj)  :  - y  <  x2  <  j}. 


/ 


Computing  we  find 


1 

and  conditions  a)  and  b)  of  Theorem  3.1  hold  on  V.  Solving 


It  is  interesting  to  remark  how  these  transformations  for  (4)  are 
used  in  practice.  We  want  to  choose  the  control  u  to  drive  the  plant. 

If  all  state  variables  are  available  to  us  (estimates  can  be-used  if  they 
are  not),'  we  nap  to  the  linear  system  and  choose  our  control  Tn+^.  Then  to 
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find  u  we  just  have  to  solve  the  equation 

(“l)n_1u  <  dT1,(adn“1f,g)  >  +  <  dTQ,f  >  -  Tn+1 
which 'is  linear  in  u. 

If.  we  consider  the  multi-input  system  ^ 

(9)  i(t)  -  f(x(t))  +  £  u  (t)g  (x(t)) 

i-1  1 

then  for  our  target  controllable  linear  system  we  choose  a  Brunovsky  [12] 

canonical  form  associated  with  a  set  of  Kronecker  indices  ic  ,ic  , • •• ,k. . 

1  2  m 

A  transformation  T  *  (T^.T^,  *  *  *  »Talm)  exists  if  and  only  if  the  following 
conditions  are  satisfied  for  all  (x^,x^,»  •  •  ,x^)  near  the  origin.  The  set 

C  “  (gL ,  [f ,  gLl ,  •  •  • ,  (adC1_1f ,  gl)  ,  g2 ,  [f ,  g2]  ,  •  •  • ,  (ad^^f , g2)  ,  •  •  • , 

spans  an  n  dimensional  space,  the  sets  Cj  He  with 

Cj  *  ^Sl’  ^.8^  »**• »  (adKr2f  ,gi)  ,g2,  [f,g2]  .•••,(ad<j~2f,g2)  ,•••, 

VIf’8m1**,*’(ad,Cr2f’8m)} 

are  involutive,  and  the  span  of  each  Cj  equals  the  span  of  C  j  H  C  for 
j  ■  l,2,***,m.  This  result  is  proved  in  [9],  and  a  discussion  [11]  of  this 
topic  was  presented  at  the  recent  JACC  meeting. 

For  time  varying  nonlinear  systems  one  must  replace  the  Lie  bracket 
[*,♦]  with  a  time  varying  Lie  derivative  (see  [10],  [13],  and  [14]),  The 
tine  variable  t  appears  as  a  parameter  in  constructing  the  transformation, 
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and  we  nap  Co  a  controllable  linear  tine  invariant  systen. 

In  system  (4)  we  assume  that  f(0)  *  0.  Current  research  is  in  pro- 

•  I 

gress  for  which  the  origin  may  not  be  an  equilibrium  point  of  the  vector 


field  f. 


\ 

The  authors  wish  to  thank  George  Meyer  for  valuable  conversations. 
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Global  Mappings  of  Nonlinear  Systems 
L.  R.  Hunt*  and  Renjeng  Su 
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If  f  and  g  are  complete  vector  fields  on  !Rn  we  examine  the  nonlinear 

system 

x(t)  *  f(x(t))  +  u(t)g(x(t))  . 

We  find  sufficient  conditions  for  the  existence  of  a  global  transformation 
defined  for  all  (x^xj,  .  .  x^eR*1  which  takes  our  nonlinear  system  to  a 
linear  system.  A  constructive  proof  is  given  of  the  transformation  by  solving 
a  system  of  partial  differential  equations.  We  require  that  the  vector  fields 
g,[f,g],  •  •  •»  (adn  1f,g)  span  an  n-dimensional  space  at  each  point  of  Rn, 
that  the  set  {g,[f,g],  ...»  (adn  2f,g)}  is  involutive  on  Bn,  and  that  the 
noncharacteristic  matrix  defined  in  our  construction  of  the  transformation 
satisfies  the  assumptions  of  various  global  inverse  function  theorems. 


*0n  le&w  from  Department  cf  Mathematics ,  T<s.\a-»  rech  Culver.-,  .ty,  Luilaok, 
lent:- ,  7 
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i  riL.  -Lwi*.  J  bW*-H0f 


1 


I 


NOMENCLATURE 

Infinitely  differentiable  functions 
B°  n  dimensional  real  Euclidean  space 
<i£  controls 
X  tine  derivative  of  x 
X.  INTRODUCTION 

Suppose  we  have  che  nonlinear  system 
a 

i(t)  -  f(x,t)  ♦  £  u. (t)g. (x.t)  ,  (1) 

1-1  1 

where  f,  g, .  .  .  gg  are  complete  (no  finite 
escape  time)  #“  vector  fields  on  Kn  and 
f(0,t)  -  0  for  each  C.  Recent  results  have  con¬ 
tained  necessary  and  sufficient  conditions  for  there 
to  be  a  local  transformation  of  system  (I)  to  a  con¬ 
trollable  time  Invariant  linear  system.  We  combine 
these  results  with  certain  versions  of  the  global 
Inverse  function  theorem  to  yield  conditions  under 
which  a  global  transformation  exists.  To  simplify 
notation  in  this  paper  we  restrict  co  the  single 
Input  time  Invariant  (autonomous)  system 

x(t)  -  f(x(t)]  +  u(c)g[x<t>J  .  (2) 

but  cur  theory  will  generalize  to  the  system  (1). 


In  this  single  input  case  we  map  to  the  linear 
system  In  Integrator  form 


•  ■ 

*  - 

■  * 

y\ 

yt 

0 

h 

a 

y . 

+  V 

0 

Vi 

V 

0 

> , 

0 

1 

*  « 

(3) 


If  we  were  considering  the  general  system  (1)  then  we 
attempt  to  map  co  a  linear,  system  In  its  Brunovsky  i'J) 
canonical  form  associated  with  che  Kronecker  Indices. 
We  Indicate  the  mappings  of  Interest  Co  us.  A 


Vm  transformation  T  -  (Tj ,  T2 ,  .  .  . ,  T^)  takes 

((Xj.x, . x^u)  »P«ce)  to 

K1*-1  ((Tj.T, . Tn,Tn+l)  -  (yj.y, . y„.v)  spcce) 


so  that  T  has  the  following  properties: 

I)  T(0)  -  0  , 

II)  Tj.Tj . Ta  are  functions  of 

xj,x: . x„  only  and  have  a  ncnslngular  Jacobian 

matrix  on  Rn, 

III)  Tn+1  is  s  function  of  x.,x. . xn,u 

which  can  be  Inverted  in  terms  of  u  for  all 
(Xj.*j . xn)cEn, 

iv)  T,,T2.  •  •  . ,  Tn  are  the  state  variah)cs 
and  Tn+1  the  control  for  our  linear  system  (3), 

v)  T  •  (Tj.T.,  ....  Tn+1)  is  i  one-to-one  map¬ 
ping  of  Kn+1  to  Hn+1  taking  system  (2)  co  sys¬ 
tem  (’). 

Ke  denote  by  if.gl  che  Lie  brackets  of  our  vector 
fields  £  and  g,  (ad'f.g)  •  (f,"!f,gil,  .... 

(ad^f.g)  -  [ f , (ai^“ s f . r) 1 .  If  the  set 

fB.if.gl  ....  (adn"'f.g))  spans  an  n-almenslona ! 

space  ot  he  origin  In  V."  and  the  set 

Ig.ff.g],  .  .  .,  (ndn“*f.g)l  is  invelutlve  ;or  all 

points  rear  the  origin,  then  system  (2)  c.i"  be 
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♦ 


cransfotmed  to  system  (3)  for  all  (xl,xl,  .  .  . ,  x^) 

In  some  open  neighborhood  of  the  origin.  If  these 
conditions  on  (g.tf.g),  .  .  (adn*Tf,6))  and 

{g.If.S).  ....  (adn'2f,g)/  hold  at  every  point  in 
J'.r-,ve  give  a  construction  of  a  transformation 
T  -  (T,,Tj.  ....  Tn+l).  In  the  process  of  building 
T  we  Introduce  the  nonchar.ictcristlc  matrix  (named 
for  the  noncharacccriscic  condition  In  parclal  differ* 
ential  equations).  If  this  matrix  satisfies  the 
ratio  condition  on  Rn,  or  other  conditions  for  which 
we  can  apply  a  global  Inverse  function  theorem,  the 
transforaatlon  T  exists  for  sll  (xj.x, ,  ....  xn)cBn 
and  satisfies  properties  i)  chrough  v)  listed  above. 
For  other  results  concerning  the  transformation  from 
nonlinear  systems  to  linear  systems  we  refer  to  the 
work  of  Krcner  (13) .  Bcockece  (2),  Meyer  and  Clcolani 
(14) ,  (13) ,  and  Jakubczyk  and  Respondek  (11)  ,  and  tha 
authors  (7,8,9.10).  Our  results  depend  on  the  local 
theory  developed  in  (18).  Details,  proofs,  and  other 
theories  concerning  global  transformations  appear  in 
(7). 

Classical  results  involving  global  inverse  func¬ 
tion  theorems  ire  found  in  the  work  of  Hadaoard  (M , 
(3) ,  and  (0 ) .  Other  interesting  theories  In  this 
direction  are  attributed  to  Palai3  (16),  Berger  and 
Berger  (1) .  Uu  and  Desoer  (19) ,  Kou,  Billet,  and  Tam 
(12) ,  and  Sandberg  (17) . 

Section  II  concaine  definitions  and  tha  system 
of  psrtial  differential  equations  from  (18)  chat  we 
must  aolve  in  order  to  find  a  transformation  of  the 
desired  type.  In  section  III  we  state  our  main 
result,  give  an  outline  of  its  proof,  and  prtsanc 
several  examples  of  its  application. 

II.  DEFINITIONS 

For  V“  vector  fields  f  and  g  on  Sn  (or 
generally  on  a  differentiable  manifold),  we  define  the 
Lie  bracket 


(f.g) 


|Sf 

ax 


li 

ax 


8 


with  3g/Jx  end  3f(3x  denoting  n  x  n  Jacobian 
matrices.  Ue  also  have 

(s'-*  .*J  -  g 

(ad*f,g)  -  (f.gl 

(adlf,g)  -  (f.lf.gj] 


(ad^.g)  -  [f  .(ad*"1!  ,g)l  . 

A  collection  of  '<?"  vector  fields  fj,f,,  .  . 
fr  on  Rn  Is  called  lnvclutlve  if  functions  oijitfx) 
exist  such  char 


[fj.fjlCx)  -  p  bljlc(x)fk(x),  1  S  i,)  S  r,  i  i  i  . 

Clven  c  point  xtcr.n  ar.d  an  ir.voiutive  set 

{f.,f. . fr)  cf  vector  fields  on  Kn,  then  there 

exists  a  unique  maximal  r-dicensi anal  submani¬ 

fold  5  of  F.n  cor.rainir.g  >::  so  that  the  c.’.r.fccr.t 
to  S  at  p*i.h  ^oir.t  1a  the  *pocc  spanned 

by  . fr(x).  in  thla  ensv  S  la 

rnr.ifold  of  f.tf« . fr  throunh 

zii>  fey  the  Frebcniu*  Thecrcr,. 


Let  f  be  a  vector  field  «nd  b  a  J**** 

function.  Then  the  Lie  derivative  of  h  with  respfer 
to  f  is 

Lf(h)  -  (dh.f) 


where  dh  is  the  gradient  and  (• , •)  la  che  duality 
between  cne  forms  and  vector  (lelds.  If 


then 


(dh,f> 


+ 


For  w  a  *€m  one  form  on  Rn , 
derivative  of  u  with  respect  to  f: 


we  have  the  Lie 


l(M 


where  *  denotes  the  transpose  and  3m*/3x  and  3f/3x 
are  Jacobian  matrices. 

Our  three  Lie  derivatives  are  related  by  the 
formula 


“  (l-f(<*>)  ,g)  +  (“.If-gl)  •  (A) 

Recall  that  we  want  e  transformation  that  maps  our 
system  (2)  to  syscem  (3),  which  we  rewrite  in  the 
T  *  (TltTj,  ....  Tn4.j)  coordinates  (for  R0+<)  as 

T,  -  Tj 

i,  -  t, 


t 


n 


T 


n-M  ‘ 


From  (18) ,  necessary  and  aufficien:  condition*  for  tin* 

existence  cf  a  transformation  for  (X|,Xj . x„) 

in  some  neighborhood  of  the  origin  in  Rn  are: 

a)  The  set  (g,(f.g),  .  .  .,  (adn'lf  ,g) )  spans  an 
n-dlmensional  space,  and 

b)  The  set  (g.lf.g),  .  .  ..  (adn_2f,g))  is  tnve- 
lutlve , 

both  In  a  neighborhood  of  the  origin.  A  tranl or tut i mi 
T  must  satisfy  the  partial  differential  equations. 

(dTj.g)  -  0,  1  -  1.2,  .  .  .,  n  -  1  ) 

(dlj.f)  -  LfCTj)  -  T1+1.  1  -  1.2 . n  -  l| 

<dT„.f  +  ug>  -  Lf+ug(Tn)  -  Tn+l  J 

(3) 


as  shown  In  (Up . 

UsinR  the  furmuli 


(t-r,  ,udkr  ,g)>  - 

(•*"„•  f  T  UK>  - 


(A)  these  cqu.:t  J.-ms 
0,  k.  -  0,1  ....  * 

T 

‘  n+i 


<M 
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The  second  equation  In  (6)  is  Che  ante  ns 

(-l>n‘,<dT1.(odn-,f.g)>u  *  <dTn.r>  -  T>+1  . 

Thus  co  find  a  t rnna format iop  hi*  thru.  enmiJ  I  Ioiih 
1)  c  trough  v)  an  given  in  the  InLriKiuct  Ion  hold,  wr 
nu3t  find  a  solution  Tj  of 


(dTl,(adkf,s))  -  0.  k  -  0,1,  ....  n  -  2 
(dT,,(adn“'f,g))  i»0 


(7) 


that  vanishes  at  the  origin.  The  functions 

T2,T, . Tn+l  are  then  easily  derived  froa 

equations  (5) 

The  local  theory  gives  us  a  irons formation  which 
Is  applicable  co  a  neighborhood  of  the  origin  In  Kn, 
but  gives  us  no  Idea  of  Che  size  of  Che  neighborhood. 
Next  we  construct  a  solution  Tl  and  Introduce  condi¬ 
tions  under  which  the  transformation 
(Tj.Tj.  ....  Tn+l)  Is  global. 

III.  CONSTRUCTION  OK  THE  MAPPTNC 

Ve  begin  this  section  by  building  a  function  T2 
which  satisfies  equations  (7).  The  first  (n  -  1) 
equations  are  partial  differencial  equations  chat  are 
solved  by  reducing  co  ordinary  differential  equations. 

Parameters  c1,c2I  ....  cn_1  are  introduced  as 
follows.  For  all  tjtR  we  solve 


dx 

dt, 


(adn_1f.g) 


with  initial  conditions  x(0)  «  0  co  find  Che  Inte¬ 
gral  curve  of  (adn'1f.g)  through  the  origin.  For 
every  t,cF  we  examine 


dx 

dt. 


(adn",f,g) 


Wt,). 

satisfying  x(tj.O)  » fir.  Continuing,  we  solve  for  all 
t|tX  the  equation 


dx 

dt. 


(adn',f.g) 


with  x<C1.t2,0)  -x(tj,t2).  This  argument  Is 
repeatad  until  Che  final  step  Is  reached,  solving 


dx 

dc_ 


8 


with  the  initial  conditions 

x(c2,c2,  •  .  .,  t^j.Ol  —  x(t^,c2,  ■ 

Thus  we  have  a  function 

(tj,t2».  •  • « cn)  —  (Xjftj.tj,  .  .  .  ,t„)  ,x2  (t2  ,t2 


cn-i^ 


which  has  as  its  Jacobian  matrix  the  noncharacte. Istic 
tat  rix 


u , 


. 

M  . 


•Ix, 


3x  '  3x 
n  n 

at. 


n 


(^ ) 


n  By  design  this  matrix  evaluated  at  the  urif.Xn  in 
K  Is  the  same  an  the  matrix  with  columns 
(adn~lf ,g) , (adn“2f ,g) ,  .  .  g  evaluated  at 
(0,0,  .  .  ..0).  This  last  matrix  Is  nonsingular,  so 
by  the  inverse  function  theorem  there  is  an  open 
neighborhood  of  the  origin  on  which  we  can  solve  for 
tl,t2,  .  .  . »  tn  as  functions  of  x,,x2,  .  .  .,  x_. 

If  we  can  find  a  solution  of  equation  (7)  ns 

a  function  of  tl,t2,  .  .  .,  tn  wc  have  a  solution  in 
the  x  variables  also. 

We  now  use  our  assumption  chat  g,[f,g],  .  . 
(adn“2f,g)  are  involutive.  The  Frobcnius  Theorem 
tolls  us  that  if  we  fix  the  parameter  and  let 


tj.c,,  . 

g*(f*gl.  .  .  ..  (ad 
crans format ion  is  not  unique)  for 


ary  ve  gee  an  integral  manifold  of 
n*2f,g).  An  obvious  choice  (our 


iS  Tt 


This  funecion  is  constant  on  each  integral  manifold  of 
t.ff.g).  .  •  ..  (adn-,£,g)  and 

JTj  5T,  3T, 

• 

Hence  the  first  (n  -  I )  equations  In  uqtiallons  (7) 
lio  Id . 

Suppose  ^dT,,(.vl"”  f ,  g))  *0.  Since  T,  «  I  , 
wc  hove  dTj  i  0,  and  (<i<ln-lf ,g)  +  0  al  the  orlxln 
Implies  that  (adn_1f,g)  is  tangent  to  the  Integral 

manifold  of  g,[f,g) . (ndn-*f,g)  at 

(0,0 . 0)  ,  a  contradiction  to  the  assumption 

that  g.lf.g),  .  .  (ad"”1f,g)  are  lineccly  Indepen¬ 

dent  there. 

Hence  there  Is  an  open  neighborhood  of  che  origin 
on  which  conditions  1)  through  v)  hold.  Note  chat 
Tj  •  c2  and  the  napping  which  has  che  noncharxcterls- 
tic  matrix  as  Its  Jacobian  macrlx  caps  che  origin  to 
the  origin,  implying  that  T(0)  •  0. 

The  function  T.  satisfies 


(dTl,Udkf,g)>  -  0.  k  -  0.1, 


and 


Hi 


3T, 


Hi 

3t„ 


Likewise  T2  is  a  solution  of 

(dT2 ,  (adkf  ,g))  »0,  k  -  0,J 
(use  formula  (4)),  and 


ill 

H, 


:»Tr 

~ 


•  0 


3T. 

J*t, 


Continuing  In  this  way  uc  have  T,  Independent,  of 
tj.tj,  .  .  cn.  and  In  general  does  noc  depend 

on  . tn  for  1  <  1  S  n  -  1.  Since 

Tj.Tj,  .  .  . ,  Tn  arc  linearly  Independent  and  their 
Jacobian  matrix  with  respect  to  tj.tj,  .  .  . ,  tn 
has  all  entries  above  the  diagonal  zero,  the  napping 
CT, ,r2 ,  ....  Tn)  is  one-to-one  as  a  function  in  the 
t  variables. 

Hence,  wherever  assumptions  a)  and  b)  hold,  the 
only  possible  obstruction  to  constructing  a  transfor¬ 
mation  T  -  (Tx.Tj . TVh)  with  the  desired 

properties  is  contained  in  the  noacharacteriatlc 
matrix.  Our  global  result#  depend  on  chis  matrix. 

The  first  theorem  that  we  need  Is  found  In  (12). 

Theorem  3. 1 

Suppose  there  Is  a  map  -»  Rn  which  is  dlf- 

ferenclable  wlch  Jacobian  matrix  J(x).  If  there 
exists  a  constant  e  >  0  such  that  Che  absolute 
values  of  Che  leading  principal  minors  1], A, ..... 
of  J(x)  satisfy 


for  all  xcR,  then  H  is  one-to-one  from  R  onto 
Ra. 

The  condition  stated  on  the  absolute  values  of 
leading  principal  minors  is  called  tha  ratio  condition. 
Our  first  result  Is  proved  by  applying  Theorem  3.1, 
the  construction  of  our  transformation,  and  Che  com- 
satnts  made  during  and  after  Che  construction. 

Theorem  3.2 

For  system  (2)  assume  t lint  the  set 
{g.If.g],  ....  (adn'1f ,g) i  spans  an  n-dlmenaionnl 
apcce  ac  each  point  of  Rn  and  the  sac 
Ig.if.g].  •  •  ••  (adn-Jf,g))  is  lnvolutive  on  Rn. 

If  the  noncharacterlsclc  matrix  (8)  satisfies  the 
ratio  condition  on  Rn.  then  c here  exists  a  ff“ 
tranafortsatlon  T  •  (Tj.Tj.  .  .  Tn^  with  the  fol¬ 
lowing  properties. 

1)  T(0)  -  0, 

liVTj.T, ,  .  .  . ,  Tn  are  functions,  of 

Xj.Xj . only  and  have  a  nonsingular  Jacobian 

matrix  on  Rn , 

111)  Tn+1  Is  a  function  xltx2,  ....  xn,u 
which  can  b*  invented  In  terras  of  u  for  all 
(X.,xf,  ....  xn)eRn, 

lv)  T,,TV,  .  .  . i  Tn  »re  the  state  variables 
and  Tn+]  the  control  (or  our  llncnr  nyr.tem  (3), 

v)  T  *  (T[,Tj . Tn+1)  ls  3  one-to-one  map¬ 

ping  of  Kn+1  to  Rn+l  taking  system  (2)  to  system  (3). 

S-nasr-le  3.  3 

We  take  the  nonlinear  system  on  R* 


M  r  xi2  *  + 


+  u  •  f(x(t))  +■  ug(x(t)). 

L’J 


-CeK*  +  1) 

r.gi  - 

o 


is  linearly  Independent  f rera  g  oh  F"  ,  and  the  in  -.■* 
lutlvc  assumption  on  k  Is  trivial  for  two  d irons I oris . 

Solving  dxj/dl  j  »  -(?*'  ♦  l).  dxr/<Jt,  -  0  with 
Xi(0)  ■  0  and  x?(0)  »  0  we  have  Xj  ■  -2lj  and 
x7  *  0.  Examining  dx,/dt,  -  0  and  dxm/dL*  •  i  with 
Initial  conditions  x1(t1(0)  ■  -2t j t  and  -  0 

wc  find  x^  -  -2tj*  and  x2  ■  t7.  In  thin  c. ikc  the 
nonchar acccrlstic  matrix  Ik 


-2  0 

0  1 


which  fulfills  the  ratio  condition^  with  e  *  1/2. 
Hence  our  transformation 

<T,.T,.T,>  -  (-  T.  -  i  (t  ♦  «“*  ♦ 

- 1  *1  (1  s’  +  ux?  +  *•)  +{*' 

-  J  {ex2  +  I)  (x,  *  +  u)^ 

is  defined  on  all  of  R2  and  has  properties  1) 
through  v). 

The  proof  of  the  following  corollary  depends  on 
results  like  Theorem  3.1  from  (12). 


Our  first  Lie  bracket 


Corollary  3. A 

Suppose  f  K  •  1  f  .  )*.  I  *  •  •  ..  (ad  *  f.K)l  spans  an 

n-d  I mens  J  non  I  space  **«>  Kn  and  f  r. .  |  f  .  k  I . 

(adn“?f,g)}  Lm  liwulut  Ivi*  there.  If  fix*  anNiimpt  l«»ti  •  *,, 
the  ratio  condition  for  the  noncharact cr I  hi  I c  matrix 
in  Theorem  3.1  ls  replaced  by  any  of  the  following 
hypotheses,  then  the  conclusions  of  that  theorem 
remain  valid. 

1)  There  exists  an  n  *  n  nonsingular  constant 
matrix  A  such  chat  A  multiplied  on  the  rl*ht  by 
the  noncharacterlsclc  matrix  satisfies  the  ratio  con¬ 
dition  on  Rn. 

2)  The  noncharacturistlc  matrix  (with  a  possible 
premul  tipllcation  by  an  n  *  n  non?;  I  uvular  const  an* 
matrix  A)  is  positive  definite  on  R*1. 

3)  The  determinant  of  the  mmclia  racier  t.st  ie  matrix 
is  positive  on  Kn  and  the  sura  of  the  non.-haraclcr  i 
tic  matrix  and  its  adjoint  has  nun -ne pul  1 ve  principal 
minors  for  all  xtR,T.  In  this  case,  the  noncharaet  rr* 
lailc:  matrix  may  he  p  reran  1 1  1  p  I  led  hy  an  <•  *  n  Holi- 
sinful  ar  constant  matrix  A  as  before. 

Often  wc  cannot  construct  •»  l  rans  I  orm.it  ion  mo  all 
of  Fr*  but  arc  able  lo  do  so  on  same  fixed  open  sub¬ 
set  U  of  Rn.  In  this  direction  wc  consider  the 
following  example. 

Example  3.5 

Let 

“in  Tsir.  x2"  "0“ 

x2  •  sin  >:3  +  u  0  ■  f(x(t))  +  u(t)gt*x(t)) 

x?  L  0  J 


U  -  (x1,r..lxl)g{l 


I 


Our  Lie  bracket*  are 


£f.g) 


"  0 

-COS  X 

0 


and 


[f.lf.gll  *  (*dJf.g> 


cos  x4  cos  x,~ 
0 
0 


which  together  with  g  apan  a  three-dimensional  spaca 
on  U.  Also 


Ig.U.g]} 


0 

sin  x, 
0  . 


implying  that  g  and  (f,g]  are  lnvolutlve  on  U. 
Solving 


dx 

dtt 


(adzf.g). 


dx 

dc2 


jet 


K 


In  ordar  and  wlih  the  proper  Initial  conditions  we 
find  tnat  xi(ti,t,,t,)  »  ti,*j(tl,t,.t,)  «-(coa  c.)t,, 
and  x,(Cj,t2,t,)  «  t,.  Solving  lor  t^tj.t,  aa 
functions  of  x, ,xt.x,  wo  obtain  tt  ■  x1(t2  -  xs, 
and  t,  *  x,/(-cos  x,5,  all  valid  on  U.  The  trans¬ 
formation 


Ti'“ 

Tt  «  sin  Xj 


T,  *  cos  x,  aln  x, 
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T,  -  -eia  X,  eio1  x,  ♦  (co#  x2  cos  x,)u 


hold#  for  all  (Xj.Xj.x,)  In  U. 


CONCLUSIONS 

He  have  given  a  theorem  and  its  corollary  In 
which  v«  acate  conditions  for  having  a  global  trans¬ 
formation  of  a  nonlinear  system  to  a  linear  system. 

In  doing  so  ve  gave  an  explicit  method  for  constructing 
Such  a  transformation  and  introduced  the  important 
noncharacceriacia  matrix.  Examples  that  illustrated 
our  technique  were  preseuted. 
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IKTRDDUCTIOH 

-  By  means  of  cute  space  coordinate  changes  and 
feedback,  a  controllable  linear  system  can  be  trans- 
forsed  into  decoupled  series  of  integrators,  that  is 


“  U1 


xkj  +  I  “  *1^  +  2 '  *1^  +  2  "  xkj  +  »»  •  •  •» 

ikl+kJ-uf 


Xkj  +  .  .  •  +  Vyg.  | 


1  “  XkA  + 


+  Knx  *  ** 


This  particular  form  is  sometimes  called  the  Brunovsky 
canceled  fern.  It  Is  also  well  known  that  the  orders 
of  these  series  cf  integrators  are  invariant  -mder  the 
trc.-:«:"o  — atior.s  (2,  I);  they  are  usually  called  the 
Krcsecktr  incites. 

In  the  literature,  there  have  been  some  efforts 
cade  to  generalize  these  linear  results  to  nonlinear 
systems.  Meyer  and  Clcoiani  (1)  showed  chat  the  class 
of  block-triangular  systems  can  be  transformed  into 
the  i  run  tv  city  canonical  form.  Brocket!  (#)  discovered 
an  important  class  of  Invariants  (discussed  later)  and 
alt:  gave  a  necessary  and  sufficient  condition  for  a 
tlr.tli.-ir.tui  nonlinear  rye  ice  to  be  transformed  into  t 
cir.tlo  series  cf  integrators.  By  enlarging  the  sec  of 
crin* fr-nttirns ,  Su  (£)  obtained  a  still  larger  equiv¬ 
alence  tits*.  Later  in  (f)  the  authors  extended  the 
tit.lti  t.  ,  1-1.1  : r motions  and  also  gave  a  way 
o:  t:r.=  trurttr.g  then. 


The  goal  of  this  paper  is  to  generalise  the  pre¬ 
vious  results  to  obtain  a  characterization  of  the 
class  of  linear  equivalents  of  multiple-input  non¬ 
linear  systems.  Recently  the  authors  were  Informed  by 
Professor  Z.  Sontag  about  similar  results  obtained  by 
Jakubexyk  and  Rospondek  (?);  we  shall  briefly  discuss 
those  results  in  the  final  section. 

^-EQUIVALENCE 

He  consider  nonlinear  systems  In  Kn  of  the  fora 
x  -  F(x,  ut,  .  .  . ,  u,)  (1) 

where  F  defines  a  fm  vector  field  and  F(0)  »  0. 

A  F-traisfcms-iier.  is  defined  to  be  a  V  * 
def f eoDorphism  which  preserves  the  origin,  t.  system 
is  said  to  be  S'-relcted  to  another  system  1 , 
on  a  neighborhood  U  C  F.n-r=  If  there  is  a 
.f -transformation  T  *  (I,,  T;,  ....  T,..^)  such 
that  for  any  state  and  control  trajectory 

(x,(t) . x^tt),  •  •  ••  u-(tj)  in  U  of 

£j.  the  image  TGcjU),  ....  :n,(t) ,  ujCti . 

u_(t>)  corresponds  to  a  state  ard  control  trajectory 

of  the  system  1-,  with  Tn+1 . 7n+-  being  the 

controls  of  r2.  By  arguments  siailar  to  chcsa  is  Uj 
vt  have  the  following  results. 

Prorosltlon  1 

A  system  x  •  F(x,  u2 ,  .  .  . ,  u_,)  is  ^-related 
to  a  system  y  -  G(y,  v. .  ,  .  .  v=)  if  c-.d  only  if 

there  is  a  transform?  tweo  X  •  (T . . Tn,  1.,*, 

Ir<+a)  such  that 

1.  Tj . Tn  are  independent  of  the  con¬ 
trols  u. . u_ 

2.  The  Jacobi.vn 

f  *  *»— ;  1 
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is  Den? tabular  near  the  origin  of  Rn+*  and 

3.  T  satisfies  the  system  of  partial  differ¬ 
ential  equations 
n 

rJ '  Cl'r  •  . . 

l-i 

vhere  F  -  { Tx . Fn) .  G  -  (Cj . C*) ,  end 

C^*T  denotes  the  cosposition. 

Preposition  2 

The  ^relation  is  an  equivalence  relation. 

Vlth  this  equivalence  relation  ve  now  examine 
the  equivalence  classes  that  contain  the  controllable 
linear  systems.  It  is  clear  that  a  nonlinear  system 
Chit  Is  ^-equivalent  to  a  controllable  linear  sys¬ 
tem  is  also  S’ -equivalent  to  the  Brunov sky  canonical 
form  of  the  linear  system.  More  precisely,  for  such 
a  nonlinear  system 

”  FiOtj*  ....  Xq,  Uj,  .  .  .,  %)  , 

1  —  1,  . ,  n 


it  Is  associated  with  a  transforms  don 
T  —  (Tj,  .  .  . ,  Tg,  ,  •  •  . ,  Tpaq)  and  a  set  of 
'a  positive  integers  (k1#.  .  .  1c,)  such  that  by 

letting 


t-o 


vith  k0  -  0,  v«  hive 

3T, 


atiH 


\  ®”i 

Ti(x’ u* . 

3-i 


“  *i+a 


V‘ 


+  l<i<«p-l 


V;t*p 

/  ,  a*,  r3(x’  ui 
3-i 


•  *  •  •  * 


«a>  -  *, 


o+p 


(2) 


(3) 


vhere  p  -  1 . n.  Observe  that  for  each  fixed 

state  x  €  Ra,  Eq.  (2)  is  a  system  of  n  -  a  linear 
equations  vith  constant  coefficients  satisfied  by 

F. . Fn.  A3  an  extension  of  the  result  in  (5), 

vc  have  the  following  necessary  condition. 


In  this  section  ve  show  that  the  invariants  dis¬ 
covered  by  Erockett  (<)  are  else  invariants  under  ou# 
^-transformations.  First  ve  need  the  foilcvir.g 
observation. 

✓ 

Observation  1 

If  a  system  £:x  •  F(x,  u, . uB)  is 

^-equivalent  to  s  system  of  the  form 
/ 

a 

X  -  f(x)  -  £  ci(x)«i(x,  ux,  .  .  , 

i-1 

then  £  aust  take  the  siailat  fora  as 

A  i 

f  -  f  (9)  -  2  *i<y)*i<y.  vx . va^  • 

i-i 

This  can  be  verified  by  simple  computations  which  ve 
leave  to  the  reader.  Vith  this  fact  ve  can  now 
generalize  Erockett1 s  results. 

let  Lit j  be  the  linear  span  of  all  the  vec¬ 
tor  fields'  *  that  are  lie  derivatives  of 
f,  gx,  .  .  . ,  gB  vith  the  total  degree  of  tj>  vith 
respect  to  f  being  less  than  or  equal  to  i  sad 
the  total  degree  of  V  with  respect  to  the  g'e 
being  less  than  or  equal  to  3.  For  example 

1*1,1  -  liaeer  span  {f.  gt . g„,  (edlf.  g,), 

....  (adlf,  gjj),  (ad*f ,  gx) . (sd*f,  Sq)  }  . 

Here  the  Lie  derivatives  are  defined  inductively  as 

s>  -  c.  (*dif ,  c 

and 

(sd^.  g)  -  (ad^.-Cad1-^,  g»  , 

where 

2L  ii 

3x  '  «x 

are  the  Jacobian  matrices. 

Let  be  the  subspace  {p|p  •  h(0),  b  €  LXij) 


Theorem  2 

The  dinensions  of  S^.,  ire  5"- invariant. 


Thecrem  1 

If  s  system  x<  -  F^Cx,  ux ,  .  .  u-)  , 

1  -  1,  ....  a  is  S'  -equivalent  to  a  controllable 
linear  system,  then  F  -  (F.  ,  .  .  . ,  F_)  Bust  take 
the  form 

n 

“i  *  Sij  fx)dj  (x,u) 

j-l 

fit  all  i-1,  .  .  .,  n,  vhere  3,  (0,0)  -  0  end  the 

Jacolia *  c(t: . S^J/iCuj,  ,J.  .,  Ug)  is 

locally  neasicgular. 

Since  the  ror'.sien-nt  of  the  scalar  finctloos 

e: . i~  hy  a  set  of  new  controls  v.  ,  .  .  .,  v- 

i»  a  leg.Ciz.*;*  rent m tier  .  ve  snail  .iac-ms*  s 

tct.mt.:  r  stem  .1.  ■  cm  : ? mi  c:  ,:i  3  •  . 


The  proof  is  omitted,  but  ve  remark  that  essen¬ 
tially  Erockett 's  proof  in  (4)  v  ori-s  in  our  case. 

Kov  ve  examine  sens  import*.. t  dimensions  of  the 
which  ve  need  in  characterizing  the  equivalence 
class  of  interest  to  us. 

Civen  a  cyztem  (f,  gj . g^.)  in  F.n  vc 

construct  e  matrix  ef  vector  fields 


nr- 

61  1 

i  *=  : 

■  (ui1!.  S.i 

1  -- 

t.w*  f  ,  g.)  ! 

1  :sc  f,  t.l 

• 

1 

1  * 

!  ; 

'*5a*;?a  2,' 

AC~’ 1 '  .  C .  . 
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Lat  5X  denote  the  1th  row  of  the  aatrix.  la 
terns  of  Li  <  we  have  Le,x  *  span  (B0), 

Lj  ,  •  spaa  { 3 e ,  Bt},  etc.  Ue  asause  In  the  rest  of 
the  piper  that  the  dimension  of  Li.j  Is  constant  on 
cone  neighborhood  and  define  the  following  indices. 

1.  Indices  a  -  (e# . Oq-i) 

oA  «  din  Si,  i  for  0  <  1  <  n  -  1 

2.  Indices  B  •  (S# . .  £n_j) 

-  aQ.  and  0^  •  a.l  -  a*.x 
for  1  <  I  <  n  -  1 


n 

Z 


^7  8V°': 


+  i<i< 


*-p  - 1.  i  <  p  < 


Z^'>‘ 

i-i 


■i+i  • 


(6) 


(7) 


3.  Indices  y  “  (yi 
t 


Yt  •  #i  -  8t+1 


Indices  k  *  (k1 ,  . 

k^  -  the  number  of  8j 


Tj^j) 

for  0  S  i  S  n  -  1 
with  Bo  -  0 

V 

'  "s 


The  nenslngularlty  of  the  transformation  T  in  turn 
implies  chat  the  matrix 


with  Bj  >  i 

Clearly,  by  way  of  construction,'  the  Indices 
( o( ,  oj,  ,  .  .,  oT1_1)  are  an  Increasing  sequence. 

By  computation,  it  also  can  be  shown  that 

(S(>  3X . So-j)  is  always  a  decreasing  sequence, 

and,  thus,  Ti  1  0  for  all  1.  In  this  paper  we  are 
Boat  interested  la  the  case  1a  which  as  -  m  aad 
Ojj.j  •  n.  If  this  is  the  case,  one  has 

o-l 

1-0 

and  . 

m 

J  ki  -  n  . 
i-i 

MAIM  RESULTS 

9s  continue  the  development  begun  in  the  second 
section  (^-equivalence).  Suppose  a  eystea. 

(f,  g. . Sc)  la  3 -equivalent  to  a  controllable 

linear  system  associated  with  a  set  of  (Kronecker) 

indices  (V, . kg)  where  kx . k«  are 

positive  integers  aad 

X)  k.  •  n  . 
l-i  1 

Equations  (2)  aad  (3)  then  become 
n 


n  n  i 

\K 


Z-r  3*j  8lJ  Z_/  SxJ  *“ j 

J-i  '  j*i 


(8) 


is  nonaingular. 

Recognizing  that  the  surma  cion 


n 

z 

1-1 


»v 


can  be  expressed  es  a  duality  product  of  ferns  snd 
vector  fields,  Eqs.  (6),  (7),  and  (E)  are  equivalently 
rewritten  as 


and 


Z  .alj  (\J  *  s»jui  +  •  •  '  * 

•p-i  ♦  1  <  i  S  *p  -  l 


J*- 


Ti+i  •! 

f(4) 


(9) 

(1C) 

(11) 


end 


\  -Ze 


/  77“  (\'  +  +  •  •  •  +  «bj«b>  *  ^  • 

fr'  J  (5) 

vhcrc  1  <’*,<•  tr.i 

w 

kt 

L*  0 

Sir  it  .bt  rirbi-'-tr  i  Airies  ! 
i  '  ‘  irilTll;  .  •  <i  *.r. 


vi :  * 
4  :* 


(«!»  8k)  "  0  *  1  <  k  «  n,  1 

*p-i  +  1  S  *  S  *p  -  1.  1  ‘P<e| 

(dT1,  f)-  T1+1 

"<««>•  8=)  •  •  •  <«V  Sm>" 

(dT*  -  Sr.)-  •  •  <«.  •  6=) 

U  u  i  B 

i*  cons  insular,  where  d^  is  the  /erre 

Oil  2Ii\ 

v»*» . * 

tad  the  disliry  product  (dT^,  f)  lc 
n 

Yiii . 

dL-r  J 

J*i 

Using  the  wel’-te.o-.v.  ferrule 

(ti'iT,  f),  g)  -  d:cT,  g5  “(dT,  (ad1!,  g))  .02; 

pre:»:;.-.v  si.«:ussicr.  re;.l:s  In  the  f  illowi:.; 

theorem. 
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Theorem  3 

A  system  (f.  g> . g0)  1*  ^-equivalent  to 

a  controllable  linear  cyetes  if  and  only  if  there  is 
a  set  of  positive  integers  kj,  k.,  .  .  .,  kg  vith 

n 

23  ki  ■ n  * 

i-i 

and  a  scalar  functions 


V  +  i*  Ta}  +  x>  * 

such  that 

1.  The  forms  dTa  +  Jt  . 
linearly  Independent ,  1 


-  Ts. 


+  l 


dT 


*»  +  1 


2.  (dT,  +  j.  (ad1!,  g3))  ■O.lfis*, 


0  <  i  < 


*p+l  ”  2» 


3.  The  matrix 


»•  <•*  ***  '•  s,)> 


<«.,  <»<  1  '•  «•>> 

"Ws-1.  %\ 


(si, 


Im4 


0  . 


is  nonsingular,  where 

ap  -  ^  k£  ,  1  <  P  <  a,  k. 

Therefore,  the  problem  of  the  existence  of  a 
transformation  T  becoaies  the  existence  of  a  posi¬ 
tive  integers  and  s  linearly  independent  exact  on  ef¬ 
forts  vLich  satisfy  the  above  conditions.  Once  these 
s  one- forms  are  obtained,  the  rest  of  the  transforma¬ 
tion  can  be  constructed  by  Eqs.  (10).  Ue  remark  that 
in  the  proof  of  the  above  theore=  one  has  to  show  that 
a  transformation  so  constructed  in  indeed  nonsingular. 
For  details  the  reader  is  referred  to  our  psper  (8 ) . 

Hext  we  come  to  another  main  theorem  which  gives 
conditions  on  the  characteristics  of  a  system  (includ¬ 
ing  the  invariant  indices  discussed  in  the  preceding 
section  —  ^-invariants)  for  being  ^-equivalent  to 
a  linear  system. 

Thecres  4 

a  s  vs  ten  (f,  g. ,  .  .  .,  gg)  is  ^-equivalent  to 
s  controllable  linear  system  with  a  controls  if  and 
only  if 

1.  a,  -  c.  3;,.!  «  r. 

2.  For  each  i  such  thcr  y.  r  0,  the  set  of 
vector  fields 


•  i  — 2 

‘  •  •  •  So*  *•••  ^  *  6j  )  • 

is  frvclutivt. 


(adi_lf,  ga)} 


Now  suppose  ij  -  i.  >2,  end  uij,  .  .  •  •  “y 

art  needed  ocs-foms  that  vanish  cn  the  integral 
nar.il cl;  associated  with  1,,  r.anely,  the  Integral 
•_*ni:.-ld  ef  li  -1,1.  One  step  of  Lie  detivstlve 

flv(t  forms  L/ (.-,)■  ....  LeUj)  which,  by 
definition  of  ■_ . .,  ,  have  tero  duality 


. . .  '  ’* . **"'  _1x  *  ,,1‘ 


theorem. 


is  involutive. 


Theorem  5 

If  a  system  (f,  g, . gg)  satisfies  the  con¬ 
ditions  in  Theorea  1,  namely.,  , 

1.  c,  -  m,  aa_j  •  a 

2.  For  each  i  such  that  ti  i  0.  Lj  _  t  1 

is  involutive  i  * 

then  'L3<1  is  involutive  for  every  0  <  J  <  ix  -  1. 

CONCLUDING  REMARK^ 

In  this  paper  we  have  obtained  a  complete  charac¬ 
terisation  of  these  nonlinear  systems  which  are 
^■-equivalent  to  a  controllable  linear  system.  These 
nonlinear  systems  are  grouped  into  difference  equiv¬ 
alence  . classes  Indexed  by  invariant  indices. 

The  conditions  stated  in  Theorem  4  are  weaker 
than  those  in  the  paper  by  Jakubcryk  and  Retpondek  (?) . 
Sene  of  their  conditions  are  redundant,  as  Indicated 
by  our  Theorem  3.  Also  their  proof  is  an  existence 
proof,  whereas  ours  is  constructive  in  nature.  Details 
and  explicit  construction  of  the  desired  transforma¬ 
tions  are  omitted  here.  They  will  appear  in  our 
paper  (5) . 
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CONTROLLABILITY  OF  NONLINEAR  HYPERSURFACE  SYSTEMS 
L.  R.  Hunt* 


ABSTRACT.  Consider  the  nonlinear  system 
n-1 

x(t)  *  f(x(t))  +  £  Mt)a-(x(t)),x(o) 
i*l  1  1 

*  X  £  M 
0 

where  M  is  a  connected  real -analytic  n- 
dimensional  manifold,  f.g-j . gn_^  are 

real-analytic  vector  fields  on  M,  and 
u^t...,u  j  are  real -valued  controls.  Ue 

are  interested  in  characterizing  the  largest 
open  subset  U  of  M,  if  any,  which  is 
reachable  from  x  and  which  we  call  the 

region  of  reachability  of  our  system  from  xQ. 

If  the  Lie  algebra  LA  generated  by 

f ,g^ ....  ,gn_i  and  successive  Lie  brackets 

has  vector  soace  dimension  n  at  x  .  and 

o 

if  f,g^,...,gn^  are  linearly  independent 

at  some  point  in  M,  we  find  the  region  of 
reachability  from  xQ.  Suppose  U  is  the 

smallest  open  subset  of  M  with  xQ  £  0 

so  that  ?U  contains  the  integral  man.' folds 
of  the  Lie  algebra  L'A  generated  by 

g^,...,g  1  that  intersect  it  and  f  assigns 
vectors  on  ?U  which  point  in  the  direction 
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of  0.  Then  U  is  the  region  of  reachability 
from  xQ  for  our  system.  Much  of  the  work'is 

involved  in  proving  a  similar  result  in  the 
more  general  Vm  case  under  the  stronger 
assumption  that  f ,g^ . ,gn_1  are  linearly 

independent  on  the  connected  <gm  n-dimen- 
sional  manifold  M. 


1.  INTRODUCTION.  Let  M  be  a  connected  real -analytic  n- 
dimensional  manifold,  f  ,g^ . .  ,gn.1  be  vector  fields  on 

M,  and  u.| . u^  be  real-valued  controls.  The  system 

n-1 

x(t)  =  f { x( t ) )  +  £  u,(t)g.;(x(t)),  x(o)  «  x  £  M  , 
i-1  1  1  0 


is  called  a  hypersurface  system  since  the  number  of  controls  is 
one  less  than  the  dimension  of  the  manifold  M.  We  assume  that 
the  vector  space  dimension  of  the  Lie  algebra  Lft  generated  by 

f,g1 . gn_1  and  successive  Lie  brackets  is  n  at  xQ  (i.e. 

this  Lie  algebra  spans  the  tangent  space  to  M  at  xQ)  and  that 
f ,g1 , . . . ,g  i  are  linear  independent  at  some  point  of  M.  Re¬ 
sults  due  to  Sussmann  and  Jurdjevic  [18]  and  Krener  [15]  show 
that  we  can  reach  a  nonempty  open  subset  of  M  from  x0.  Let 
U  be  the  largest  open  subset  which  is  reachable  for  our  system 
from  xQ.  We  characterize  U  by  proving  that  U  is  the  small¬ 
est  open  subset  of  M  with  xQ  €  0  (the  closure  of  U  in  M) 
satisfying  i)  3U  contains  the  integral  manifolds  which  inter¬ 
sect  it  of  the  Lie  algebra  L'A  generated  by  g-,,...,gn_1  and 
successive  Lie  brackets  and  ii)  the  vector  field  f  points  in 
the  direction  of  0  on  ?U.  This  set  U  is  called  the  region 
of  -eecnafiity  (r om  x_,  and  if  j  *  M,  the  system  ‘s  control- 
lafcle  r  x  . 

"he  real -analytic  theory  depends  on  results  provec  for  a  sys- 


n-1 


£  u.  ( t)g.  (x(  t) ),  x ( o } 
;’l  ’  1 


wire 


f  t  j 


,c  ,  are 
■  n-  \ 


as 

v 


linearly  independent  vector  f-'e1ds 


112 


HYPERSURFACE  SYSTEMS 


on  a  connected  V*  n-dimensional  manifold  M. '  For  this  case 
we  show  that  the  region  of  reachability  U  is  the  smallest  open 
subset  of  M  with  xfl  £  U  satisfying  U  is  an  (n-l)-dimen- 

sional  integral  manifold  of  g1,...,gn_1  and  f  assigns  vectors 
on  3U  in  the  direction  of  U..  We  view  the  control  problem  much 
as  one  would  the  famous  Holmgren's  Uniqueness  Theorem  (see  [4]) 
of  partial  differential  equations.  A  solution  of  the  partial 
differential  equation  is  unique  up  to  the  characteristics.  A 
similar  situation  occurs  in  the  study  of  uniqueness  of  analytic 
continuation  for  the  CR-functions  on  a  9fm  real  hypersurface  in 
In,  n  >  1  [11].  In  this  instance  one  gets  uniqueness  up  to  the 

characteristics  of  the  tangential  CR-equations,  which  are  the 
integral  manifolds  of  a  subbundle  of  the  tangent  bundle  to  the 
hypersurface  of  codimension  1.  If  f ,g^ ,. . . ,gn_1  in  our  system 
are  linearly  independent  on  M,  then  g1,...,gn_1  give  us  a 
subbundle  of  the  tangent  bundle  of  codimension  1.  Thus  the  only 
possible  way  to  have  a  set  which  is  not  reachable  from  xQ  is  to 
have  it  disconnected  in  some  fashion  from  the  reachable  set  by 
an  integral  manifold  of  9] »• •• »g„„i •  Of  course  such  integral 
manifolds  may  not  even  exist  in  which  case  we  expect  controllabil¬ 
ity  of  the  system  from  xQ.  Michael  Freeman  has  results  giving 
conditions  at  a  point  for  there  to  be  an  integral  manifold  of  a 
collection  of  real-analytic  vector  fields  through  that  point. 

A  nice  expository  paper  containing  the  problems  considered  in 
this  article  has  been  written  by  Srockett  [ 1 j .  Related  results 
can  be  found  in  the  work  of  Krener  [15],  lobry  [16],  [17],  Suss- 
msnn  and  Jurdjevie  [IS],  and  the  author  [12],  [13].  Theorems 
concerning  cne  oroblem  of  local  control’ ability  along  a  reference 
trajectory  are  due  to  Hermes  [7],  [8],  [9].  Results  cealing  with 
control  theory  for  linear  systems  are  in  [14],  This  paper  is 
arranaed  in  the  following  way.  In  section  2  we  give  definitions 
and  a  relevant  example.  Section  3  contains  a  local  tnecry  con¬ 
cerning  the  boundary  of  the  region  of  reachability  U  of  cur  V 
system  under  the  assumption  that  the  boundary  is  t'1  near  one 
of  its  points.  In  section  4  we  state  a  tneorem  from  [11] 
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concerning  a  subbundle  of  the  tangent  bundle  to  M  and  allowing 
us  to  remove  the  restriction.  Then  we  prove  our  main  result 
for  the  v"  case  and  give  several  applications.  Section  S  con¬ 
tains  our  result  for  the  real-analytic  system  on  a  real-analytic 
manifold. 

2.  DEFINITIONS.  We  shall  use  the  classical  Frobenius  Theorem 
and  Chow's  Theorem  [2].  For  a  statement  of  these  results  and 
their  applications  to  control  theory  we  refer  the  reader  to  [1]. 

Of  interest  to  us  is  the  system 

n-1 

x(t)  -  f(x(t))  ^  ^  u. (t)g^ (x(t) )  •  x(o)  -  xQ  e  M  (2.1) 

where  M  is  a  connected  Vm  n-dimensional  manifold, 
f  ,g1 ,. . .  ,gn_1  are  Vm  vector  fields  on  M,  and  u1,...,un_1 
are  controls. 

Let  T(M)  be  the  tangent  bundle  of  M  with  TX(M)  the 
tangent  space  for  x  €  M.  Recall  that  if  X  is  a  vector  field 
on  M  (i.e.  X  is  a  section  of  T(M) )  then  a  is  an  integral 
nerve  of  X  if  a  is  a  W*  mapping  from  a  closed  interval 
I  c T?  into  M  such  that  *  X (o( t ) )  for  all  t  6  I. 

DEFINITION  2.1  [18].  If  D  is  a  subset  of  T(M),  then  an 
ir.tecral  curve  of.  0  ie  a  mapping  a  frtn  c.  real  interval 
[t.t'j  into  M  such  that  there  exist  t * tQ  <  t^  <  . . .  < t^  *  t' 
snd  teeter  fields  X-j , . . .  .X^  in  D  uith  the  restriction  sf  a 
t:  being  or.  integral  curve  sf  X^  ,  for  each 

i  «  1 ,2, . . .  ,k  . 

DEFINITION  2.2.  let  0  be  a  subset  cf  T(M)  and  let 

£  M.  a  point  x  €  M  is  V-reaa'nable  frsx  x.  if  there  is 
an  integral  curve  a  cf  0  and  sore  T  a  o  in  the  interval  far 
a  suer,  that  a(o)  •  x  and  a(T)  «  x.  A  cutset  A  of  *•'  is 

3 -reachable  ~rc~  x_  if  e-  ;ru  coint  x  6  A  is  reachable  frc~  x„. 

— —  — —  — —  o  -  -  •  o 

Since  the  D  we  consider  is  the  subset  of  T(«)  civen  by 
t-e  vector  fields  of  a  system  of  the  form  !2.1)  we  croc  the  D 
're-  3-reachable.  We  shall  rake  assintirs  o'  f ,?1 . c„_1 
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that  assure  us  that  we  can  reach  an  open  subset  of  M  from  xQ. 

DEFINITION' .2. 3.  The  largest  open  subset  U  of  M  which  is 
reachabl e  from  xQ  is  called  t ht  region  of  reachability  from  x  ■ 
If  U  •  M,  we  say  that  the  systm  is  controllable  from  x  ■ 

DEFINITION  2.4.  Let  0  be  an  open  subset  of  M  and  let 
x  €  30  suah  that  30  it  a  manifold  near  x.  Then  f 

•points  in  the  direction  of  0  ( or  towards  0)  at  x  if  f(x) 

is  not  tangent  to  30  at  x  and  if  there  exists  an  open  neigh¬ 
borhood  H  of  x  in  M  such  that  the  vector  assigned  by  f  at 

x,  projected  into  M  (by  the  exponential  map),  and  intersected 
with  W-{x)  is  contained  in  0.  If  0  is  a  manifold  and 

if  the  above  is  true  for  every  x  €  30,  then  f  points  in  the 
direction  of  0  on  30. 

DEFINITION  2.5.  Let  0  be  an  open  set  in  M  and  let  x€  30- 
Then  f  points  in  the  direction  of  5  (or  towards  0 )  at  x 
if  there  is  cat  open  neighborhood  W  of  x  in  M  such  that  the 
integral  curve  of  f  starting  at  x  and  intersected  with  U  is 

contained  in  5.  If  this  is  true  for  all  x  €  30,  then  f 

points  in  the  direction  of  0  on  30. 

If  f  and  g  are  v”  vector  fields  defined  on  M,  we 
define  the  Lie  bracket  of  f  and  g  by  [f,g]  *  ff  f  ’  §£  9- 
A  set  of  V™  vector  fields  {f ^ ff}  is  called  involutive 


if  there  exist 


functions  y^k(x)  on  M  such  that 


(*)  *  £  Yijk^Vx^  for  all  i.j.lSl,  j 

I>>1. 


Sr,  i  f  j. 


We  introduce  one  example  from  [1]  which  may  help  us  understand 
the  problems  involved  in  trying  to  determine  the  reacnihle  set 
of  a  syste". 

EXAMPLE  Consider  the  system 


x-j ( t } T  j'o  l’j  fr. 

*  1  !  I 

'*7(t)!  |l  cj  !x, 

i  2  )  t  J  i  2 


K 

fl 

X- 

l  2; 

♦  u(t)  J 
1‘ 

fl  o'  (x^j 

I  !  [  ! 

P  -1!  iX2j 


f(x(t!)  +  uft)g(x(t) ) 
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2 

where  M  is  the. set  H  -  {(o,o)}.  Notice  that  f  and  ,g  are 
linearly  independent  on  M.  Brockett  [1]  states  that  if  * 
(x^.Xj)  is  in  the  positive  quadrant,  then  the  region  of  reach* 
ability  U  from  xfl  is  contained  in  this  quadrant.  The  inte* 
gral  curve  of  g  through  a  point  on  «  o,  Xj  >  o  is  the 
line  (o.Xj)  with  Xj  >  o.  Moreover,  the  integral  curve  of  g 
through  a  point  on  Xj  ■  o,  x-j  >  o  is  the  line  (x^,o)  with 
x1  >  o.  These  together  form  the  boundary  of  the  first  quadrant 
in  M.  The  vector  field  f  assigns  vectors  to  this  boundary 
which  point  toward  the  first  quadrant.  Thus  there  is  no  hope  of 
a  solution  of  the  system  starting  in  this  quadrant  to  leave  it. 

We  could  give  more  examples  at  this  time,  but  they  would  all 
hint  at  the  same  conclusion.  If  f  ,g^ ... .  ,gfl_1  are  linearly 
independent  in  the  system  (2.1),  the  important  items  to  check 
appear  to  be  the  integral  manifolds  of  g.,...,gn if  any 
exist,  and  the  direction  of  the  vector  field  f  on  these  inte¬ 
gral  manifolds.  We  next  examine  these  conditions  for  regions  of 
reachability  with  i?1  boundaries. 

3.  if1  BOUNDARIES.  Suppose  we  consider  the  system 
n-1 

x(t)  *  f ( x( t ) )  +  u.(t)gi(x(t>),  x(o)  «  xQ  €  M,  (3.1) 

where  M  is  a  connected  n-dimensional  manifold, 

f , ....  •  9n_i  are  linearly  independent  vector  fielcs  on 

M,  and  u . u  ^  are  controls.  It  *s  easy  to  see  that  we 

can  reach  an  open  subset  of  M  which  contains  x0  in  its  clo¬ 
sure. 

Let  u  be  tne  '•ec-'on  cf  reachability  c*  t*e  nyoersu,"ea:e  sys- 
te~  given  in  .3.1}.  Let  *  be  an  element  e*  tne  boundary  cf 
U,  and  assume  is  V'  is  some  open  neignocrnood  »  c'  x 
i r  «.  As  just  mentioned  we  need  to  consider  the  directions  of 
f  on  w  ft  3U  and  the  possibility  of  having  an  integral  manifold 

tf  c . c,  1  through  x.  Retail  that  a  d-ffere't*able  sub- 

t  S  of  -  ’S  an  irztfrsl  C*  ;1 . :r., 

*  ' S  is  the  scace  spanned  by  oi,...,cn_i  at  .v  fc-  each  y€3. 
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For  a  more  thorough  discussion  of  Integral  manifolds  we  must 
consider  the  Lie  bracket  which  we  defined  earlier.  If  g^  and 


are  different  vector  fields  on  M  then 

8gi 


r  i  *i 

^9i,9j^  “  3x  9j  ‘  3x  9 


This  may  or  may  not  give  us  a  “new"  direction  in  which  to  move 
(see  [1]),  depending  on  whether  the  collection  {g.  ,g^}  is 
involutive  or  not.  Let  L'A  be  the  smallest  Lie  algebra  gener¬ 
ated  by  taking  successive  Lie  brackets  of  the  . *9n_i 

given  in  equation  (3.1).  If  we  get  a  vector  space  of  the  same 
dimension  at  each  point  of  M,  then  L'A  is  a  vector  subbundle 
of  the  tangent  bundle  to  M. 

The  following  definition  is  essential  to  our  work.  Let  S, 

f  1 

and  S2  be  9  submanifolds  of  M  of  dimensions  k  and  n-k 

respectively. 


DEFINITION  3.1.  The  manifolds  cud  Sj  intersect  trcns- 
versellu  at  a  point  y  €  S1  n  Sj  if  end  only  if  T^ ( S 1 )  ® 

TytSj)  *  T  (H).  Rots  ©  denotes  t he  direct  sun. 

We  now  prove  a  result  under  the  assumption  that  locally  our 
open  set  has  a  boundary. 


THEOREM  3.2.  Let  0  be  an  open  set  in  M  vhicr.  is  reachable 
from  xQ  for  system  (3.1),  end  let  %  be  or.  arbitrary  point  in 
50.  Surtost  there  is  or.  ever.  neighborhood  W  of  x  in  M 

'l 

such  that  W  fl  SO  is  a  9?  reel  (n-1 )- dimensional  submanifold 
-  •  y  ~ ^  Z * Q 

is  not  the  ncion  c*  the  reachability  erc~  x_ ; 


i;  the  fiber  dimension  of  t  ’  ^  at  x  is  n, 
ii;  the  integral  curve  of  seme  5.,  1  s  i  s  n-1  is  trans¬ 
versal  to  eO  et  x, 

ill)  f  assigns  at  x  a  vector  pointing  in  the  direction 
of  the  complement  of  0. 


i)  holds  then  the  fiber  dimension  cf  L'.  at  all 

-  K 

or.r.ts  in  sere  cben  neichborhood  of  x  in  M  must  be  n  (since 


Proof . 

'.r.ts  ir 

n  is  maximal).  Thus  ;C  cannot  be  an  integrate  manifold  of 
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g^,...,g  ,  near  x  by  the  Frobenius  Theorem,  and  there  exist 
a  point  y  €  30  arbitrarily  close  to  x  and  a  g^ ,  IS’iSn-l, 
such  that  the  integral  curve  of  g,.  is  transversal  to  30  at 
y.  Hence,  i)  reduces  to  ii). 

Next  we  assume  that  ii)  is  true.  If  the  integral  curve  of 

g1 ,  chosen  arbitrarily  from  the  set  g^ . gn_.|  and  renumbered 

if  necessary,  is  transversal  to  30  at  x,  then  it  is  trans¬ 
versal  to  30  in  W  fl  30,  W  being  an  open  neighborhood  of  x 
in  M  (this  W  may  be  smaller  set  than  our  original  W).  Fol¬ 
lowing  the  integral  curves  of  g^  that  start  in  W  fl  0,  a 
reachable  set  from  xQ,  and  continuing  past  W  n  30,  we  have 
that  0  cannot  be  the  region  of  reachability  from  xQ.  We  have 
used  the  fact  that  we  may  as  well  assume  we  can  move  along  the 
Integral  curve  of  any  g^  since  u^  is  unbounded. 

If  iii)  holds  at  x,  then  it  holds  for  all  points  in  W  n  30, 
and  the  argument  given  in  ii)  with  g1  replaced  by  f  implies 
the  desired  result.  n 

It  is  interesting  to  note  that  condition  1)  does  not  depend 
on  W  n  30  being  a  V1  manifold. 

We  seek  a  minimum  number  of  necessary  conditions  that  an  open 
set  U  e  M  be  the  region  of  reachability  from  xQ. 

THEOREM  3.3.  Lai  U  ba  iha  radon  of  reachability  from  x 

’  1  ° 

of  iha  ayatam  (3.1).  Suppoaa  3U  ia  s  ¥  manifold  for  cm 
span  neighborhood  W  of  x  €  3U  in  M.  Than  W  fl  5U  is  cn 
ir.zacrzl  manifold  of  , . . . , 5n_i >  one  iha  seczcr  field  f 

zsaigns  zs  W  n  ju  saszsra  pointing  in  iha  direction  of  U. 

j'zof.  It  follows  ‘'Cm  part  ii)  of  the  preceding  theorem 
:*at  w  .1  3U  is  an  integral  manifold  of  g1 ,. . .  .Sn_-(  •  Hence 
-  *  Vj  is  act-ally  a  ¥m  submanifold  of  M.  Since 
*.c^,...,g  ^  *o rr  a  linearly  independent  set  on  M  and 

'  *U  in  an  inteoral  manifold  of  g^,...,gn_p  part  iii) 
•-t'-es  the  statement  concerning  f.  - 

i.a  jra'l  orove  in  the  next  section  tnat  the  hypothesis  SU 
is  ' C'  near  x  is  superfluous. 
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4.  THE  RESULT  FOR  V  MANIFOLDS.  The  following  theorem 
was  proved  in  [II]  for  use  in  the  uniqueness  of  analytic  contin¬ 
uation  problem  for  CR-distributions  on  CR- hypersurfaces  in 
n  2  I 

I  ,  n  >  1.  The  statement  concerning  a  V  boundary  can  be 

relaxed  to  S?1 ,  or  we  can  simply  replace  if1  by  ^  every¬ 
where  in  the  preceding  section. 

THEOREM  4.1.  Let  M  be  a  1?"  manifold,  of  dimension  n, 
and  let  H  be  a  subbundle  of  the  tangent  bundle  of  M  with 
fiber  (vector  space)  dimension  n-1 .  Suppose  U  c  M  is  an 
open  set  with  the  property  that  if  0  e  U  is  an  open  set  having 
a  <ig^  boundary ,  then  for  each  x  €  30  n  3U  we  have  7^(30)  * 
Hx  (the  fiber  of  H  at  xj.  Then  for  each  point  x  €  3U, 
there  is  a  neighborhood  V  of  x,  a  real-valued  function 
h  £  (V)  with  nonzero  differential  for  all  points  in  V,  and 

a  closed  nowhere  dense  set  E  cR  such  that 

(1)  3U  n  V  *  {x  6  V | h( x)  €  E), 

(?)  for  each  t  €  E,  S^  *  (x  €  Vfh(x)  *  1}  is  an  integral 
manifold  of  H;  i.e.  the  boundary  of  U  is  foliated  by  integral 
manifolds  of  H . 

We  now  restate  Theorem  3.3  under  more  general  conditions. 

THEOREM  4.2.  Let  U  be  the  region  of  reachability  from  x 


of  the  system  (3.1 ) . 

*•'  9l . Vl  ^ 

the  direction  of  U. 


Then  3U  is  c  V  integral  manifold 
f  assicns  vectors  cn  3U  which  taint  in 


Proof.  Let  H  be  tne  suobundle 


scanned  b> 


c, . c  , .  If  0  is  ar.  ooen  subset  of  U  with  a  V  ic.nd- 

arv,  tnen  an  apoli cation  of  Theorem  3.<  and  Theorem  4.1  gives  us 
the  stateo  conclusion.  a 

We  nave  the  following  imoortant  corollary,  the  oroof  of  which 
is  oovious. 


IQFCLLA.RY  4,3.  "setose  M  contains 


*;r  oaten  setr.  ;• 
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a)  The  closure  of  the  integral  manifold  in  M  ie  foliated 
by  integral  manifold*. 

b)  The  veetore  assigned  by  f  on  this  integral  manifold 
always  point  in  the  sane  direction  relative  to  the  integral  mani¬ 
fold  (i.e.  if  this  manifold  divides  M  into  two  components,  the 
vectors  must  point  toward  the  sane  component J . 

Then  the  system  (3.1)  is  controllable  from  any  x  6  M. 

Let  A  denote  Hausdorff  measure  (see  [3])  In  dimension  d 
on  M.  Suppose  L  is  the  set  of  points  on  which  the  Lie  algebra 
L'a  has  dimension  n-1.  Then  L  is  a  closed  set  in  M,  and 
the  Frobenius  Theorem  implies  that  L  contains  the  integral 
manifolds  of  g^,...,g  ,,  if  any  exist.  For  such  an  integral 
manifold  we  must  have  a"_1(L)  >o,  and  we  have  proved  our  next 
result. 

THEOREM  4.4.  If  An_^ ( L)  *  0  then  the  system  (3.1)  is  con¬ 
trollable  from  any  xq  £  M. 

Notice  that  if  M  is  Of  dimension  2,  we  always  have  integral 

curves  of  g  for  the  system  x(t)  «■  f(x(t))  +  u(t)g(x(t) )  ,x(0) 

*  x  .  Thus  Theorem  4.4  does  not  apply  in  this  case. 

0 

We  state  two  theorems  from  [1]  and  indicate  in  a  rather  super¬ 
ficial  way  the  relation  of  these  theorems  to  this  present  work. 

We  restrict  our  attention  to  dimension  2  and  to  a  hypersurface 
system. 

THEOREM  4.5.  Suppose  f  and  g  are  vector  fields  on  a  con¬ 
nected  9fm  real  2-dimensional  manifold  M.  Suppose  that  -'f.o} 
-set  the  conditions  of  Chew's  Theorem  for  V  vector  fields, 

~-z  suzrzse  zhzt  for-  czch  i+izicl  ssrditisn  *0  scl uzisK  cf 

x't;  *  f(x(t))  is  teriodic  with  a  least  teriod  T(x  ).  Then  the 

reachable  set  rcr-  x„  o*  the  svstex  x(t)  *  f(x(t))  + 

•  o 

utt)g(x(t))  is  the  set  given  by  Chaw's  Theorem.. 

»>e  start  at  x  €  M  and  take  the  intecrai  curve  of  o 
c 

t’rjjfh  xr.  Suotose  this  curve  divices  M  ' 'tc  two  ccnnectec 
::-;c'S'.:s  v*  and  M*.  If  the  solution  cf  m:)  ■  f(*(t)) 
sta-ts  at  x  in  the  direction  of  M+,  then  s'-ce  the  sclution 
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is  periodic,  there  is  some  point  on  the  integral  curve  of  g 
through  xQ  at  which  the  vector  of  f  is  in  the  M*  direction. 
Of  course,  this  is  in  keeping  with  Theorem  A. 2. 

Hirschorn  proved  a  very  nice  generalization  of  the  following 
result,  which  we  state  in  dimension  2. 

THEOREM  4.6  [10].  Consider  the  system 

x(t)  *  f(x(t))  +  u{t)g(x(t)) ,  x(o)  *  xQ  , 

for  a  Y?  real  2-dimensional  manifold  M.  Suppose  [f,g]  *  hg 
on  M,  uhere  h  is  a  9?  function  on  M.  Then  the  reachable 
set  fras  xg  is  obtained  by  taking  the  integral  curve  a.  of  f 
through  xg  (in  the  positive  time  sense)  and  then  all  integral 
oaves  of  g  intersecting  a. 

This  2-dimensional  version  can  be  seen  in  light  of  the  follow¬ 
ing  result  found 'in  [6].  The  one-parameter  group  of  transforma¬ 
tions  generated  by  f  permutes  the  integral  curves  of  g  with 
a  change  of  parametrization  if  [f,g]  *  hg  for  some  S?”  func¬ 
tion  h  on  M.  Interpreted  freely,  once  an  integral  curve  of  f 
passes  through  an  integral  curve  of  g  it  can  never  return. 

This  seems  to  be  in  agreement  with  Theorem  4.2. 

An  obvious  question  to  ask  is  if  the  necessary  conditions  of 
Theorem  4.2  are  also  sufficient. 

THEOREM  4.7.  Zez  xQ  E  M  and  suppose  U  is  the  smallest 

even  subset  o*  M  uiih  x.  £  0  satis  ^vinc  SU  is  or.  integral 

o  ... 

manifold  of  g, ,...,g  i  and  f  assigns  vectors  to  SU  in  the 
direction  of  U.  Then  u  is  the  region  of  reachability  frc~ 
xQ  for  the  system  (3.1). 

In  the  statement  of  tm's  theorem,  we  add  the  assumption  that 
if  U  *  M,  every  ooen  neighborhood  of  any  point  o  £  JU  con¬ 
tains  points  from  U  and  the  complement  of  0.  5.  Stefani  and 

A.  Baccioti  have  pointed  out  that  the  correct  conclusion  to  the 
theorem  as  stated  above  is  lie  region  or  reachability  c  interior 
cf  U.  Tie  author  wishes  to  thank  Professors  Stefan*  and 
Sacciot-,  for  their  comments. 
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•  •  / 

Proof.  We  know  that  we  can  reach  an  open  set  and  by  the 

theory  developed  in  this  paper  we  have  that  we  can  reach  u.  The 

important  fact  to  remember  is  that  to  leave  U  we  must  break 

through  3U  near  some  point  x  e  3U.  In  the  system  x(t)  ■ 
n-1 

f(x(t))  +  £  u.(t)g>(x(t))  at  the  point  x  we  can  move  in  the 
i*l  1  1  n-1 

directions  f  ,g1 .  ,gn_1 ,  -g1 . ,-gn+1  and  f  +  £  u<(t)gi 

for  the  appropriate  finite  u^'s.  Since  3U  is  an  integral 
manifold  of  g.,.-..,g  Lie  brackets  like  [g^.g,.]  with 
i  f  j  will  give  us  no  "new"  directions  in  which  to  move  from  x. 

Also,  since  f,g1 . gfl_1  span  T(M),  the  brackets  [f.g^], 

i  ■  n-1  will  yield  vector  fields  which  are  linear  combina¬ 
tions  of  f,g1 . gn_j  (the  same  is  also  true  for  successive 

Lie  brackets).  The  only  linear  combinations  here  which  can  be 
used  at  x  are  those  already  indicated  by  the  system.  a 

The  proof  of  Theorem  4.7  is  applicable  only  for  hypersurface 
systems  (or  certain  general  systems  that  behave  like  hypersurface 
systems).  We  prove  results  like  those  in  this  paper  for  general 
systems  of  the  form 

x(t)  ■  f (x(t) )  +  52u.(t)g<(x(t)) 

1«1  1  1 

in  [12]  and  [13].  Such  a  system  with  m<n-l  is  more  difficult 
to  handle  than  a  hypersurface  system,  and  we  make  stronger 
assunotions  in  order  to  prove  theorems  concerning  controllabil¬ 
ity. 


■ .  real  •ANALYTIC  MANIFOLDS.  Wc  6Xdfmne  the  systern 
n-1 

x(t)  *  *(x(t))  ♦  £  u. (t)g.(x(t)) ,  x(o)  «  x  €  M  (5.1 ; 
i«l  1  1  0 


where  M  is  a  connected  real -analytic  n-dimensional  manifold 
f .c^ ,. . .  ,gn_.  are  real -analytic  complete  vector  fields  on 
ar.d  u,  ,...,u  ,  are  controls.  We  take  the  Lie  aclebras  L„ 

l  “  -  I  « 

and  L‘,  as  defined  orevicusly.  If  the  vector  soace  dimension 
cf  La  at  is  less  than  n,  we  aoply  the  real  analytic 
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version  of  Chow's  Theorem  and  refer  the  reader  to  [13].  We  prove 
the  following  result,  which  is  an  improvement  of  Theorem  4.7  for 
the  real -analytic  case. 

THEOREM  5.1.  Asstme  the  vector  space  dimension  of  at  xQ 

is  n  and  that  f,g^ . 9n_1  are  linearly  independent  at  some 

point  of  M.  Let  U  be  the  smallest  open  subset  of  M  with 
xq  €  0  satisfying  3U  contains  the  integral  manifolds  of  L ‘  A 
which  intersect  it  (and  which  are  given  by  Chaw 's  Theorem)  and 
f  points  in  the  direction  of  0  on  3U  .  Then  U  is  the 
region  of  reachability  from  for  the  system  (5.1). 

Proof.  Because  M,  f ,g^ ,. . .  ,9n_'j  are  real  analytic, 
f^  .g-j ,. . .  ,9n_1  are  linearly  independent  at  some  point  of  M, 
and  M  is  connected,  the  set  of  points  P  in  M  at  which 
f.g. >• • • >9n_i  are  linearly  dependent  is  nowhere  dense  in  M. 

Of  course  this  set  contains  the  points  where  the  dimension  of 
is  less  than  n.  Since  the  dimension  of  lA  at  xg  is  n, 
this  is  true  for  an  open  neighborhood  of  xQ  in  M.  From 
Krener’s  [15]  proof  we  can  reach  an  open  set  0  in  M  which  is 
arbitrarily  close  to  xg.  By  remarks  made  earlier  in  this  proof, 
we  may  as  well  assume  that  f,g^ ,. . . ,gn.^  are  linearly  indepen¬ 
dent  on  0.  Let  O'  be  the  largest  connected  component  of  M 
containing  0  on  which  f ,g^ ,. . . ,9n_i  are  linearly  independent. 
Choose  a  point  x-j  6  0  and  apply  Theorem  4.7  with  x  replaced 
by  x1  and  M  by  O'  to  get  the  region  of  reachability  U'  of 
tne  syste-.  eroz  x^  in  O'. 

We  examine  the  boundary  of  U'  in  M,  which  -.as  two  compo¬ 
nents  31"  n  P  ana  3U'  n  P,  where  P  denotes  tne  complement 
of  P  in  K.  If  3U'  IP  is  a  nonempty  set.  Theorem  4.7 
implies  that  3U'  (IP  is  a  (n-l)  dimensional  integral  mani'pld 

oe  o, . o  .  and  f  points  towards  U'  on  3U'  n  P. 

' !  *  n- 1 

We  assume  that  3'J'  n  P  is  nonempty  and  that  3U'  n  P  is  a- 
reel-anal tyic  (n-l)  dimensional  manifold  in  an  open  neighborhood 
of  a  point  x  £  3U'  n  P.  If  the  integral  curve  of  seme 
s, ,1  s  i  <  n-l ,  is  transversal  to  3U'  n  P  at  x  or  -f  f 
assigns  at  x  a  vector  which  does  rot  point  towards  O', 
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arguments  like  those  given  in  the  proof  of  Theorem  3.2  show  that 
we  can  reach  an  open  set  in  another  connected  component  of  M 
like  O',  and  we  simply  start  all  over  there. 

Thus  we  assume  that  the  (n-l)-dimensional  manifold  parts  of 
3U'  n  P  contain  the  integral  manifolds  of  l'A  which  intersect 
them  and  f  points  towards  0’  on  these  parts.  Let  xesu'n  P 
and  suppose  the  unique  integral  manifold  N  of  L1^  through  x 
contains  the  integral  curves  of  f  that  start  at  every  point  of 
N.  .In  this  case  we  have  the  vector  space  dimensions  of  and 
l'A  agree  on  N  and  if  we  reach  any  point  of  N,  then  we  can 
reach  all  points  of  N  (this  is  the  important  part  of  the  real- 
analytic  theory  in  [13]).  Then  N  will  not  be  in  U  if  3N 
contains  the  integral  manifolds  of  L'A  that  intersect  it  and 
f  points  in  the  direction  of  N  on  3N.  Of  course  this  means 
that  if  0  and  3N  intersect,  then  the  points  in  this  intersec¬ 
tion  are  in  3U  and  satisfy  the  required  conditions  on  the  inte¬ 
gral  manifolds  of  l'A  and  on  the  direction  of  f.  An  easy 
example  of  such  a  manifold  N  is  a  common  equilibrium  point  of 
f ,g1 , . . . , gn_1 ,  which  is  certainly  not  reachable. 

Hence  we  have  that  3U‘  n  P  contains  the  integral  manifolds 
of  l'A  (given  by  Chow's  Theorem)  which  intersect  it  and  f 
points  in  the  direction  of  0’  on  3U'  n  P.  Therefore  3U' 
must  consist  of  the  integral  manifolds  of  L' .  intersecting  it 
ano  f  ooints  towards  U'  on  3U' .  A  repeat  of  the  proof  of 
"Theorem  A. 3  found  in  [12]  shows  that  we  cannot  reacn  an  ooen  sub¬ 
set  of  v.  in  l"  from  x,  . 

recall  there  is  an  coen  neighborhood  of  x  in  w  on  wnich 

tne  dimension  of  L.  at  x  is  n.  Arbitrarily  c’cse  to  anv 
u.c 

point  in  this  neighborhood  that  can  be  reached  from  x^  is  an 

open  set  which,  's  reachable  from  x„.  Thus  we  can  -each  an  open 

o 

V  of  y  which  contains  x  in  its  closure,  and  we  ~av  assume 

o 

that  3V  contains  the  integral  manifclos  of  L  .  intersecting 
it  and  f  points  towards  V  on  ?v.  We  reed  to  s-cw  that 

V  *  'J,  and  we  r.now  that  x  i  v  r.  ]. 

0 

If  >r  e  V  n  U,  ‘.her  an  open  subset  cf  U  car  :e  'eacred. 
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8y  the  theory  developed  in  this  paper  u  is  reachable  and  no 
larger  open  subset  containing  II  of  K  is  reachable.  Also  V 
can  be  reached  and  we  cannot  leave  V  once  we  get  to  it.  Hence 
we  must  have  V  »  u.  If  xfl  £  3V  n  U,  we  reach  an  open  subset  i 

of  M  arbitrarily  close  to  xQ  (this  is  how  we  found  V)  which 
must  be  in  U.  The  same  proof  just  mentioned  implies  V  *  U. 

Suppose  that  xQ  £  3U.  Then  the  integral  manifold  of  L'A 
through  xQ  must  remain  in  3U  and  the  integral  curve  of  f 
starting  at  xQ  (and  moving  in  positive  time)  moves  in  0.  If 
the  curve  for  f  reaches  U,  then  we  know  we  can  reach  an  open 
subset  of  U,  which  will  also  be  an  open  subset  of  V  by  the 
way  in  which  V  was  formed.  Above  arguments  show  that  we  have 
V  m  u. 

Assume  that  the  integral  curve  of  f  starting  at  xQ  stays 
in  3U.  Taking  the  unique  integral  manifold  of  L'A  through 
each  point  of  this  integral  curve  keeps  us  in  3U.  Let  N'  be 
the  set  defined  as  the  union  of  these  integral  manifolds  of  L'A- 
At  each  point  of  N‘  we  can  start  an  integral  curve  of  f,  and 
we  suppose  that  all  such  curves  remain  in  3U.  We  can  continue 
to  repeat  this  process  and  we  assume  that  we  cannot  leave  3U. 
Perhaps  Lie  brackets  like  [f,g]  and  higher  order  brackets  will 
allow  us  to  get  out  of  3U.  Helgason  [5]  interprets  the  Lie 
bracket  [f,g]  at  a  point  x  as  the  tangent  vector  to  a  curve 
segment  (starting  at  x)  and  moving  in  the  g  direction,  the 
f  direction,  the  -g  direction,  and  then  the  -f  direction 
all  for  t  units  of  time.  Thus  Lie  brackets  will  not  help  us 
reach  ar  open  subset  c-f  M.  Since  we  xnow  that  we  can  ’•each  the 
open  set  V,  an  integral  curve  of  f  rust  take  us  into  U. 

Once  we  nave  tnis,  we  know  that  V  *  L’.  z 
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MULTI-INPUT  NONLINEAR  SYSTEMS 


L.  R.  Hunt  and  Renjeng  Su 

Abstract 

Consider  the  nonlinear  system 

x(t)  =  f(x(t))  +  J  ui (t)gi (x(t ) )  , 

where  f,g^,  .  .  • »  9m  are  C*  vector  fields  on  some  neighborhood  of  the 
origin  in  Rn  and  f(0)  =  0.  We  present  necessary  and  sufficient  conditions 
for  this  system  to  be  transformed  to  a  controllable  linear  system.  Our 
results  are  constructive  and  depend  upon  the  solutions  of  overdetermined 
systems  of  partial  differential  equations. 
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Global  Transformations  of  Nonlinear  Systems 
L.  R.  HUNT  AND  RESJZNG  SU 

Abstract  —  Recent  results  have  established  necessary  and  sufficient 
conditions  for  a  nonlinear  system  of  the  form 

*ft)  ■  f(x(t))  +  uCt)g(x(t)) 

with  f(0)  *  0,  to  be  locally  equivalent  in  a  neighborhood  of  the  cr_gin 
in  R3,  to  a  controllable  1 iaear  system.  We  combine  these  results  with 
several  versions  of  the  global -inverse  function  theorem  to  prove  suffi¬ 
cient  conditions  for  the  transformation  of  a  nonlinear  system  to  a  linear 
system.  In  doing  sc  we  'introduce  a  technique  for  constructing  a  trans¬ 
formation  tinder  the  assumptions  that  {g,[f,g],  ...»  (adn~1f,g)}  span 

an  n-dimansional  space  and  that  {g.[f»g]-»^- - (adn~Z£  ,g) }  is  an 

invoice ive  set. 
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CONTROL  OF  NONLINEAR  TIME-VARYING  SYSTEMS 


L.  R.  Hunt  and  Renjeng  Su 

Abstract 

Consider  the  time-varying  nonlinear  system  of  the  form 

m 

;(t)  =  f(x,t)  +  l  u-(t)g.(x,t), 
x  i=l  1  1 

with  f.g-j,...,  gm  being  C*  vector  fields  on  Fn+"'.  We  give  necessary 
and  sufficient  conditions  for  this  system  to  be  transformable  to  a  time- 
invariant  controllable  linear  system.  In  order  to  control  the  nonlinear 
system,  we  map  to  the  linear  system,  choose  a  desired  control  there,  and 
return  to  the  nonlinear  system  by  the  inverse  of  the  transformation. 
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TRANSFORMATION  OF  NCNHOMOGENEOUS  NONLINEAR  SYSTEMS 
R.  Su,  G.  Meyer,  and  L.  R.  Hunt 


Abstract 

The  problem  of  when  a  nonlinear  system  can  be  transformed  to  a 
linear  system  is  treated  here.  Previous  results  are  further 
generalized. 
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ABSTRACT 

Sufficient  Conditions  for-  Controllability 
L.  R.  Hunt  " 


The  problem  is  to  find  sufficient  conditions  for  the  system 

i 

n 

x(t)  ~  £  (x  ( t) )  +  [  u.  ( t) g  .  (x (t) )  ,  x(0)  =  x  c  M 

i=l  11  u 

C3 

to  be  controllable.  Here  M  is  a  connected  C-  n-dimensionai  mani- 

fold,  f,  g,,...,q  are  complete  C-  vector  fields  on  M,  and 
l  n> 

u,  , are  real- valued  controls.  If  t  *  n-1,  M,  f,  g,  ,..., 

••  *-.-*■  m _  .  ..  l 

are  real-analytic,  M  is  simply  connected,  and  g^, . . . ,g  ^ 
are  linearly  independent  on  M,  then  necessary  and  sufficient, con¬ 
ditions  are  known.  For  the  case  of  our.  £•"  system  with  general  m, 
we  assume  that  the  space  spanned  by  the  Lie  algebra  generated 

bv  f,  c, ,...,c  and  successive  Lie  brackets  has  constant  dimen- 

*  j.  'm 

sion  ?  on  M  and  the  algebra  L’  generated  by  g. ,...,g  and  succes- 


'A  ’  '1‘  am 

sivs  lie  brackets  has  constant  dimension  t1  <  p  cr.  M. 


—  -C  —  i  — .  ^ 


Chew's  Theorem  implies  controllability  for  a  p-dimensicnal  sub¬ 


manifold  cf  containing  >•  . 


<  o,  sue:  icie.no  cor.oirions 


are  feu n.d  involving  the  computation  cf  certain  _ie  rrac.-;ets  at 
points  where  the  vector  field  f  is  tangent  to  the  integral  mani¬ 
folds  cf  Li .  Core  we  assume  that  every  in to oral  manifold  cf  Li 
•  c.  r. . i i r. s  such  c  oc i r. c . 
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ABSTRACT 


n- Dimensional  Controllability  with  (n-1)  Controls 

L.R.  Hunt 

Let  M  be  a  connected  real-analytic  n-dimension'al  manifold, 
f ,g^, . . . ?gn_2  be  complete  real-analytic  vector  fields  on  M  which 

are  linearly  independent  at  some  point  of  M,  and 

be  real-valued  controls.  Consider  the  controllability  of  the 
system 

n-1 

x(t)  =  f (x ( t) )  +  £  u. (t) g. (x(t) ) ,  x(0)  =  xneM. 

i=l  1  1  u 

Necessary  and  sufficent  conditions  are  given  so  that  this  sys¬ 
tem  is  controllable  on  any  simply  connected  domain  D  contained 
in  M  on  which  g1, . . .  ,.gn_1  are  linearly  independent.  These 
conditions  depend  on  the  computation  of  Lie  brackets  at  those 
points  where  f  ,g^,  .  . .  /9n_]_  are  linearly  dependent. 
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6.  Summary: 

A  decade  ago  the  author  initiated  a  research  program  directed  at  the 
formulation  of  an  algorithm  for  f^ajult  dlagnoAli  In  analog  clxculti  and  t>yt>t<mt> 
which  was  capable  of  running  evidently  in  the  dual  mode  "depot/ filild" 
invlxo nmint  associated  with  most  DOD  maintenance  systems.  Specifically,  it 
was  desired  to  formulate  an  algorithm  which: 

i).  is  applicable  to  both  Union,  and  nonllnian.  Ayttim  modeled  in 
either  the  turn  on.  ^xiquinc y  domain, 

ii).  can  be  used  to  locate  multiple  hold  on.  tofat  (,aultA, 

iii).  and  is  capable  of  locating  failures  in  " xeplaciable  moduli *" 
such  as  an  IC  chip,  PC  boand,  or  *ub*y*tem  rather  than  dis- 
discrete  components; 

all  of  this  being  achieved  with  a  minimum  number  of  tut  points  and  at  accept¬ 
able  computational  cat.  Although  a  number  of  algorithms  which  achieve  these 
goals  in  the  linear  case  have  been  proposed  by  the  author  and  others,  little 
progress  had  been  made  in  the  nonlinear  case  until  the  past  year.  During  the 
past  year,  however,  we  have  found  a  long  souqht  mechanism  for  incorporating  a 
gallon  bound  into  a  < ilmulotlcn-afaix-tat  algonltlvm  thereby  combining  the 
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best  attributes  of  the  classical  simulation-before-test  and  simulation- 
after-test  algorithms  into  a  single  package.  Indeed,  the  success  of  our 
simulated  experiments  with  the  new  algorithm  has  been  phenomenal;  and,  as 
such,  we  believe  that  oua  new  algoAlthm  *.epAe&ent&  an  ei^entiaUiy  complete 
< solution  to  the  analog  faault  cUagnotlA  pAoblem. 
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Fault  Diagnosis  in  Electronic  Circuits 


\ 

R.  Saeks*  and  R.  Liu** 


During  the  past  quarter  century  the  engineering 
community  has  been  witness  to  tremendous  strides 
in  the  art  ol  electronics  design.  The  graphical 
algorithms  of  the  previous  generation  have  given 
way  to  the  modern  CAD  package,  the  breadboard 
has  been  subsumed  by  the  simulator.  Indeed,  even 
the  universal  building  block  has  become  a  reality. 
To  the  contrary  electronic*  maintenance  has  changed 
little  since  the  day  of  the  vacuum  tube,  remaining 
the  responsibility  of  the  experienced  technician 
with  scope  and  multimeter.  As  such,  our  ability 
to  design  a  complex  electronic  circuit  is  quickly 
out-distancing  our  ability  to  maintain  it.  In  turn, 
the  price  reductions  which  have  accompanied 
modem  electronics  technology  have  been  paralleled 
by  increasing  maintenance  and  operations  costs. 
Indeed,  many  industries  are  finding  that  the  life 
cycle  maintenance  costs  for  their  electronic  equip¬ 
ment  now  exceed  their  original  capitol  investment. 

Given  the  above,  it  is  quickly  becoming  apparent 
that  the  electronics  maintenance  process,  like  the 
design  process,  must  be  automated.  Unfortunately, 
the  30  years  of  progress  in  circuit  theory,  on 
which  our  electronics  design  automation  has  been 
predicated,  does  not  exist  in  the  maintenance 
area.  As  such,  the  put  decade  hu  witnessed  the 
inauguration  of  a  basic  research  program  to  lay  the 
foundation*  for  a  theory  of  electronics  maintenance 
and  a  parallel  effort  to  develop  operational  elec¬ 
tronic  maintenance  codu. 

Thus  far  the  greatest  succeu  hu  been  achieved 
in  the  digital  electronics  area,  wherein  the  finite 
state  nature  of  the  UUT  (unit  under  tut)  may  be 
exploited9’.  Typically,  one  assumu  that  all  fail- 

*  Texas  Tech  University 

••  University  of  Notre  Dame 


ures  manifest  themselves  in  the  form  of  com¬ 
ponent  outputs  which  are  either  “stuck-at-one”  or 
“stuck-at-zero"  and/or  shorts  and  opens9’.  Under 
such  an  assumption  a  theory  for  digital  system 
maintenance  hu  been  developed  and  practical 
fault  diagnosis  algorithms  are  in  the  formative 
stages  of  development.  Typically,  one  hypothesises 
some  limit  on  the  number  of  simultaneous  faults 
and  then  simulates  the  responses  of  the  UUT  to  a 
family  of  tut  vectors  for  each  allowed  combination 
of  fault*.  The  actual  responses  of  the  UUT  are  then 
compared  with  the  simulated  responses  to  locate 
the  failure.  Although  lacking  in  uthetic  appeal 
the  above  approach,  termed  fault  simulation,  is 
ideally  suited  for  the  maintenance  environment, 
wherein,  the  actual  simulation  process  need  only  be 
done  once  at  the  factory  or  a  maintenance  depot 
with  the  simulated  response  data  being  distributed 
via  magnetic  tape  to  the  various  field  locations 
where  the  actual  tut  actual  tut  is  conducted.  As 
such,  with  the  aid  of  some  sophisticated  software 
engineering,  this  apparently  “brute  force”  approach 
to  the  fault  diagnosis  problem  has  slowly  evolved 
into  a  workable  concept4’.  Indeed,  at  the  present 
time  a  number  of  automatic  tut  program  gener¬ 
ators  which  classify  faults,  choose  tut  vectors,  and 
carry  out  the  appropriate  simulation  (often  in  a 
parallel  processing  mode),  are  commercially  availa¬ 
ble  and,  as  such,  the  automated  maintenance  of 
digital  electronic  circuits  is  becoming  a  reality4’. 

Unfortunately,  the  above  described  succeu  in  the 
digital  world  hu  not  been  paralleled  by  progreu 
in  the  analog  world.  Indeed,  tut  engineers  com¬ 
plain  that  while  80%  of  the  boards  are  digital,  80% 
of  their  (headaches  are  analog  and  hybrid.  This 
difficulty  arisu  from  a  number  of  characteristics 
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of  the  analog  problem  which  are  not  encountered 
in  digital  circuits.  Indeed,  in  an  analog  circuit: 

(  i  )  there  is  a  continuum  of  possible  f.  lures, 

(  jj )  a  component  may  be  “in  tolerance”  but  not 
nominal, 

(i)  complex  feedback  structures  are  encoun¬ 
tered. 

(hr)  simulation  is  slow  and  costly. 

( y )  post-fault  component  characteristics  may 
not  be  known, 

(vi)  and  a  fault  in  one  component  may  induce 
an  apparent  fault  in  another 
Items  (  i )  and  ( j )  imply  that  an  extremely  large 
number  a i  simulations  will  be  required  for  analog 
testing.  Items  (i)  and  (iv)  suggest  that  these 
simulations  will  be  far  more  expensive  than  similar 
digital  simulations.  Finally,  items  '  y )  and  (vi) 
indicate  that  the  simulation  of  a  post-fault  circuit 
by  itself  may  not  be  s  tractable  problem.  As 
such,  it  is  by  no  means  clear  that  the  kind  of 
“brute  force”  fault  simulation  algorithm  associated 
with  the  digital  problem  will  be  applicable  to  the 
analog  or  hybrid  case. 

As  an  alternative  to  fault  simulation,  a  number 
of  academic  researchers  have  proposed  a  variety  of 
“post  test”  fault  diagnosis  algorithms,  wherein,  an 
“equation  solving  like”  algorithm  is  used  to  locate 
the  faulty  component  given  the  test  data  from  U 
Although  these  algorithms  are,  in  some 
sense,  “smarter”  than  the  simulation  algorithms, 
most  of  the  required  computing  must  be  done  in 
the  field  after  the  UUT  has  been  tested.  More¬ 
over,  these  computational  requirements  must  be 
replicated  each  time  a  unit  fails.  As  such,  the 
success  of  such  "post  test"  algorithms  is  contingent 
on  reducing  their  computational  requirements  to 
a  bare  minimum.  Although  no  system  is  yet 
operational,  with  the  aid  of  the  powerful  linear 
circuit  theory  developed  over  the  past  half  century, 
a  computationally  efficient  solution  to  the  fault 
diagnosis  problem  for  linear  analog  circuits  appears 
to  be  within  reach1’1'.  Unfortunately,  no  such 
light  exists  at  the  end  of  the  nonlinear  tunnel, 
wherein  progress  appears  to  be  limited  by  a 
"computational  complexity/test  point”  bound. 

Not  suprisingly,  the  computational  cost  of  an 


analog  fault  diagnosis  algorithm  is  an  inverse  func¬ 
tion  of  the  number  of  test  points  at  which  mea¬ 
surements  of  the  UUT  may  be  made.  Indeed,  if 
one  lets  n  be  a  measure  of  UUT  complexity  (which 
may  loosely  be  taken  to  be  the  total  number  of 
terminals  for  all  of  the  circuit  components),  then 
if  one  has  access  to  0(n)C(tt)  test  points  the  fault 
diagnosia  problem  can  be  resolved  .  using  linear 
algorithms”'1'’.  Moreover,  by  combining  such  algo¬ 
rithms  with  the  above  mentioned  linear  algorithms, 
acceptable  computational  efficiency  can  be  obtained 
with  0(m)  test  points  where  m  is  a  measure  of  the 
complexity  of  the  “nonlinear  subsystem”  of  the 
UUT*’'”.  Although  such  algorithms  can  be  effec¬ 
tive  on  the  typical  academic  example  a  “real  world” 
PC  (printed  circuit)  board  does  not  have  terminal 
space  for  the  20  or  30  test  points  which  are 
required  even  for  a  routine  board  made  up  of 
discrete  components  and/or  SSI  (Small  Scale  Inte¬ 
gration).  Although  the  problem  can  be  partially 
alleviated  by  making  internal  measurements  with 
the  aid  of  a  “bed-of-nails”  tester  it  has  been  our 
experience  that  such  testers  cause  as  many  failures 
as  they  locate  while  their  applicability  to  two- 
sided,  multilayer,  and  coated  boards  is  severely 
limited.  As  such,  we  would  like  to  limit  the 
number  of  test  points  to  the  terminal  space 
available  at  the  edge  of  a  PC  board.  On  the  other 
hand,  the  UUT  complexity,  n,  increases  with  the 
area  of  the  board.  As  such,  the  number  of  test 
points  required  by  an  analog  fault  diagnosis  algo¬ 
rithm  should  increase  et  a  rate  of  no  greater  than 
Ofn1'1).  A  further  study  of  the  paetible  tradeoff 
between  test  points  and  computational  cost  appear* 
in  references  11)  and  12). 

Unfortunately,  ail  computationally  acceptable 
"post  test”  algorithms  which  have  thus  far  been 
propoeed  have  teat  point  requirements  which  grow 
linearly  with  UUT  complexity  (assuming  that  m 
grows  linearly  with  n).  As  such,  many  researcher* 
are  looking  at  the  classical  fault  simulation  algori¬ 
thms  with  renewed  vigor.  Indeed,  theee  algorithms 
have  minimal  on-line  computational  costs,  while 
the  number  of  test  points  employed,  can  easily  be 

(it  1)  f(n)-0(n)  means  f  increases  in  the  order  of 
n,  more  precisely,  lf(n)|£c|n|  for  some  c>0. 
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kept  below  0(n2fl).  The  difficulty  lie*  with  the 
required  number  of  simulation*  end  the  develop 
meat  of  deciiion  algorithm*  which  will  allow  us  to 
“interpolate”  between  simulated  data  points. 

Thus,  while  the  state-of-the-art  in  digital  diaga- 
noais  is  fast  maturing,  a  serious  investigation  of 
analog  fault  diagnosis  problem*  is  only  just  begin¬ 
ning.  Indeed,  a  satisfactory  fault  diagnosis  code  for 
linear  analog  circuits  has  yet  to  be  demonstrated 
while  the  nonlinear  problem  has  yet  to  prograa 
beyond  the  basic  research  stage. 
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INTRODUCTION 

after  a  half  century  of  neglect  by  the  electronics 
tLSimnnl  *y  the  past  decade  has  witnessed  an  expand¬ 
ing  effort  in  the  analog  fault  diagnosis  area, 
indeed,  the  ever  increasing  complexity  of  elec¬ 
tronic  circuits  coefeined  with  the  decreasing  avail¬ 
ability  of  trained  saint enance  technicians  has 
pn«t.ii  c omputea-aided  tutinq  (CAT)  to  the  fore¬ 
front  of  electronics  research.  Unfortunately,  the 
tremendous  strides  which  have  been  made  in  digital 
test  technology  have  not  been  paralleled  by  equal 
progress  in  the  analog  area.  As  such,  even  though 
■80%  of  the  boards  are  digital  80%  of  the  problems 
are  analog*. 

the  lack  of  progress  in  analog  CAT  vis-a-vis 
digital  CAT  nay  be  attributed  to  four  factors  i 

i).  the  cose  of  analog  circuit  simulation, 

ill .  the  continuous  nature  of  analog  failure 
phenomena, 

lii).  tolerances  on  the  'good*  components  in  an 
analog  circuit, 

iv).  and  the  lack  of  viable  models  for  the 
components  in  a  faulty  circuit. 

Moreover,  these  difficulties  have  been  axagerated 
by  the  economics  of  the  maintenance  environment 
which  limits  the  degree  to  which  many  of  the 
classical  tools  of  analog  circuit  design  can  be 
used  in  a  CAT  package . 

The  purpose  of  the  present  paper  is  to  describe  a 
set  of  criteria  which  we  believe  a  practical 
analog  CAT  algorithm  should  achieve  and  to  indi¬ 
cate  the  degree  to  which  they  are  met  by  the  var¬ 
ious  algorithms  which  have  thus  far  been  proposed.1 
These  criteria  include  computational  requirements, 
numbers  of  test  points  and  test  vectors  employed, 
robustness  to  tolerance  effects,  availability  of 
aodela,  and  the  degree  to  which  the  algorithm  is 
amenable  to  parallel  processing.  Although  many 
specific  algorithms  have  been  proposed  they  may 
naturally  be  classified  into  three  categories! 

i).  simulation-before-test, 

ii) .  simulation-after-test  with  a  single  test 
vector, 

iii) .  and  simulation-after-test  with  multiple 
test  vectors. 

Each  of  these  three  approaches  to  the  analog  CAT 
problem  is  compared  against  our  criteria,  and, 
interestingly,  each  approach  fails  to  meet  at 
least  one  of  the  proposed  criteria. 

criteria 

*•  Computational  Requirements!  Unlike  a  CAD  al¬ 


gorithm  wh.ch  is  used  only  in  the  initial  design 
of  a  circuit  or  system,  a  CAT  algorithm  lives  in 
an  operational  environment  and  thus  must  be  used 
repeatedly  each  time  a  system  fails.  As  such,  a 
viable  measure  for  the  computational  cost  of  a  CAT 
algorithm  must  distinguish  between  on-line  computa¬ 
tion  which  is  done  in  the  field  and  must  be  repeat¬ 
ed  for  each  unit  under  test  (UUT)  and  off-line  com¬ 
putation  which  is  independent  of  the  unit  under 
test  and  thus  need  only  be  done  once  at  the  factory 
or  a  maintenance  depot.  Indeed,  the  distinction 
between  on-line  and  off-line  computation  is  further 
exaggerated  by  the  high  cost  of  computing  and  the 
dearth  of  trained  personnel  in  a  field  maintenance 
environment  vis-a-vis  that  is  available  at  a 
maintenance  depot.  Thus  in  a  CAT  algorithm  a  gut at 
pnionity  must  be  placed  on  inducing  the  on-line 
computational  wquixementi  even  at  the  cost  of 
significantly  increasing  the  off-line  computation. 
As  such,  an  algorithm  which  is  viable  in  a  design 
environment  might  not  be  acceptable  in  a  mainten¬ 
ance  environment  and  vice-versa.  Indeed,  in  a  CAT 
algorithm  one  would  be  happy  to  accept  the  cost  of 
generating  a  complex  data  base  in  an  off-line  en¬ 
vironment  to  achieve  a  reduction  in  on-line  compu¬ 
tational  requirements. 

B.  Test  Points!  Historically,  analog  circuits  have 
been  tested  with  the  aid  of  a  *bed  of  nails*  tester 
which  allows  one  to  make  use  of  test  data  which  is 
not  accessible  via  the  input  and  output  terminals 
of  the  circuit  board.  Unfortunately,  modem  cir¬ 
cuit  boards  are  often  multilayered  and/or  coated, 
thereby  limiting  the  applicability  of  the  "bed  of 
nails*  concept.  As  such,  a  modern  CAT  algorithm 
must  be  designed  to  work  with  the  test  data  which 
is  available  at  the  externally  accessible  terminals 
of  a  printed  circuit  board.  In  practice,  this 
proves  to  be  a  dominating  factor  in  the  design  of 

a  CAT  package,  which  precludes  the  use  of  some  of 
the  more  attractive  algorithms  with  test  point  re¬ 
quirements  which  grow  linearly  with  circuit  com¬ 
plexity.  In  fact,  circuit  complexity  is  propor¬ 
tional  to  the  area  of  a  printed  circuit  board  (if 
not  a  power  thereof)  while  the  number  of  accessible 
test  points  is  proportional  to  the  edge  length  of 
the  board.  As  such,  in  a  practical  CAT  package  it 
is  reasonable  to  require  that  the  numbe/i  o<  tut 
points  gww  with  the  iquane  wot  oj  circuit  com¬ 
plexity  (or  less). 

C.  Robustness  t  Unlike  a  digital  system  wherein  a 
device  is  either  good  or  bad  in  an  analog  circuit  a 
device  is  either  ”in-tolerance”  or  'out-of-toler¬ 
ance*  and,  as  such,  an  analog  CAT  algorithm  must  be 
able  to  cope  with  the  effects  of  components  which 
one  in-tolenance  but  not  nominal.  Although,  at  the 
time  of  this  writing,  there  is  insufficient  experi¬ 
mental  data  to  determine  the  import  of  robustness 
in  an  analog  algorithm  it  is,  at  minimum,  a  factor 
of  which  one  must  be  cognizent  and  may,  in  fact, 
prove  to  be  a  dominating  factor  in  the  design  of  a 
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viable  CAT  package. 


0.  Medelsi  Since  stost  CAT  algor ithma  presuppose 
some  Cora  of  circuit  sisiulation  in  their  operation 
and  design  of  such  an  algorithm  siust  consider  the 
type  and  availability  of  circuit  models  which  are 
required  and/or  available.  In  particular,  does 
the  algorithm  use  nominal  circuit  models  or 
{auitld  circuit  modtU?  Indeed,  even  if  nominal 
circuit  models  are  used  do  they  operate  in  their 
normal  range?  Finally,  one  must  consider  whether 
or  not  the  rlgorithm  is  capable  of  dealing  with 
"fussy*  components  which  do  not  admit  viable  simu¬ 
lation  models. 

Z.  Module  vs.  Parameter  Testing:  Most  analog 
fault  diagnosis  algorithsis  can  be  categorised  as 
either  module  oriented  or  parameter  oriented.  Zn 
the  former  case  the  algorithm  tests  the  input- 
output  performance  of  the  individual  nodules  or 
subsystems  which  make  up  the  OUT  while  in  the 
latter  case  the  algorithm  estimates  a  set  of  para¬ 
meter  values  which  determine  the  performance  of  a 
given  circuit  component.  Although  one  can  often 
formulate  a  circuit  model  for  a  given  module  there¬ 
by  permitting  one  to  use  a  parameter  oriented  al¬ 
gorithm  to  test  modules,  such  a  process  may 
unnecessarily  complicate  the  test  procedure.  As 
such,  a  moduli  onientid  CAT  atgonidhm  ik  pn.iitAA.td 
over  a  parameter  oriented  algorithm  if  it  can  be 
formulated  without  compromising  other  factors. 

F.  In-Sltue  Testing)  Although  secondary  to  the 
above  considerations  the  ideal  CAT  algorithm 
should  allow  for  in-kitut  ten  tying.  Since  one 
cannot  control  the  input  signals  applied  to  the 
OUT  in-situe  such  an  algorithm  must  work  with  an 
arbitrary  input  signal  rather  than  a  fixed  set  of 
test  vectors. 

S.  Parallel  Processing;  Since  the  CAT  problem  is 
inherently  a  large  scale  systems  problem  it  is 
essential  to  exploit  whatever  computational  power 
is  available  to  reduce  both  on-line  and  off-line 
computational  requirements.  In  particular,  digital 
CAT  algorithms  often  use  some  degree  of  parallel 
processing  in  their  implementation.  Given  the 
additional  computational  problems  associated  with 
an  analog  CAT  algorithm  thi  dtgnii  to  which  an  aJL- 
gonithm  can  be  implimtnttd  in  poAailtJL  become!  a 
kignifacant  facto  n.  in  determining  its  viability 
and  should  therefore  be  included  among  our  criteria 
for  an  analog  CAT  package. 

In  the  above  paragraphs  we  have  described  seven 
aspects  of  the  CAT  problem  which  must  be  consider¬ 
ed  in  judging  an  analog  CAT  algorithm.  .Although 
we  would  ideally  like  to  formulate  an  algorithm 
with  minimal  computational  requirements  a  modtAOtt 
amount  o{  oifatint  computation  is  acceptable  since 
the  off-line  computation  need  only  be  done  once 
and  is  carried  out  in  a  depot  environment  where 
good  computational  facilities  and  high  level  per¬ 
sonnel  are  available.  On  the  other  hand  since  the 
on- tint  computation  associated  with  a  CAT  algorithm 
is  replicated  for  each  TOT  and  carried  out  in  a 
field  environment  it  must  be  kept  to  a  minimum. 
Likewise  the  test  point  requirements  for  an  analog 
CAT  algorithm  must  be  kept  to  a  minimum.  Although 
the  requirement  that  the  number  of  test  points 
used  by  a  CAT  algorithm  grow  with  the  square  root 
of  circuit  complexity  Is  open  to  debate  it  is  in¬ 
dicative  of  a  fundamental  limitation  to  the  effect 
that  the  number  of  test  points  should  grow  at  less 


than  a  linear  rate  with  circuit  complexity.  Con* 
earning  the  remaining  criteria  we  want  an  algori'.tha 
that  is  robust  though  the  significance  of  this  r*-> 
quir ament  is  not  fully  understood  at  this  time. 
Similarly,  the  availability  of  circuit  models  to 
implement  an  algorithm  must  be  considered.  Finally, 
but  secondary  to  the  above  requirements,  it  would 
be  desirable  eo  have  a  module  oriented  algorithm 
which  is  amenable  to  in-situe  testing  and  parallel 
processing.  These  criteria  are  summarised  in  table. 
1,  along  with  a  set  of  goals  which  one  would  wish 
to  achieve  in  an  "ideal  analog  fault  diagnosis  al¬ 
gorithm"  . 

CAT  ALGORITHMS 

A.  Slmulation-Before-Testi  Although  it  is  essen¬ 
tially  a  brute  force  search 'algorithm,  simulation- 
before-test  is  well  suited  to  the  depot/field  com¬ 
putational  environment  of  the  CAT  problem  and,  as 
such,  it  predominates  in  most  state-of-the-art 
digital  CAT  packages.3  On  the  other  hand  its  weak¬ 
nesses  become  more  pronounced  in  the  analog  problea 
wherein  it  has  yet  to  be  successfully  implemented. 
Basically,  a  simulation-before-test  algorithm  is  a 
search  algorithm  in  which  one  simulates  the  expect¬ 
ed  test  data  which  would  result  from  various 
hypothesized  failures  in  an  off-line  environment. 
Then  when  the  actual  test  data  is  obtained  in  the 
field  it  is  compared  with  the  simulated  results  to 
determine  the  failure.  Needless  to  say  the  tech¬ 
nique  requires  immense  amounts  of  off-line  computet 
time  to  generate  the  required  data  base  but  is  ex¬ 
tremely  efficient  on-line,  wherein  one  need  only 
compare  the  test  results  with  the  simulated  data 
base. 

Unfortunately,  the  cost  of  an  analog  simulation  is 
much  greater  that  that  of  a  digital  simulation. 
Moreover,  one  requires  a  much  larger  data  base  in 
the  analog  problem  than  in  the  digital  problem  to 
cope  with  the  continuous  nature  of  the  analog  fail¬ 
ure  phenomena  and  the  robustness  problem.  .-As  such, 
there  is  considerable  doubt  about  the  applicability 
of  the  sirtulation-before-test  concept  in  an  analog 
CAT  package. 

Vis-a-vis  our  criteria  for  analog  fault  diagnosis 
simulation-before-test  requires  extremely  large 
amounts  of  off-line  computer  time  but  only  a 
minimum  of  on-line  computer  time.  Additionally, 
the  test  point  requirements  for  the  algorithm  are 
minimal.  On  the  other  hand  the  technique  has  no 
inherent  robustness  and  uses  faulted  simulation 
models  for  all  components.  With  regard  to  the 
secondary  factors  the  algorithm  is  module  oriented 
and  amenable  to  parallel  processing  but  not  in- 
situe  testing.  These  considerations  are  summarize) 
in  Table  1. 

B.  Simulation-After-Test  with  a  Single  Test  Vecta 
Rather  than  using  a  search  algorithm  for  fault 
diagnosis  one  can  attempt  to  model  the  analog  faul 
diagnosis  problem  as  a  nonlinear  equation  in  which 
one  solves  for  the  internal  variables  or  component 
parameters  in  terms  of  the  test  data.  Although 
this  may,  at  first,  seem  to  totally  bypass  the 
repetitive  simulation-before-test  algorithm,  a 
careful  analysis  will  reveal  that  each  iteration  o 
the  required  numerical  equation  solver  amounts  to 
simulation  of  the  UUT.  In  this  ease,  however,  the 
particular  simulations  which  one  carries  out  are 
based  on  known  test  data  rather  than  a-priori 
fault  hypotheses.  As  such,  the  simulations  are 
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don«  on- lint  after  the  test  data  has  been  obtained 
*fld  the  technique  is  thus  tensed  siaulat ion- after- 
tut.2 

jn  the  case  where  only  a  single  Cast  vector  is 
jeployed  Che  resultant  fault  diagnosis  equations 
tre  "almost  linear"  and  may  be  solved  with  the  aid 
oC  a  single  (off-line)  sparse  matrix  inversion.4'* 
The  test  point  requirements  for  the  algorithm, 
however,  grow  linearly  with  circuit  complexity. 
Interestingly,  this  class  of  algorithms  has  been 
discovered  independently  by  a  number  of  authors 
over  the  years,  most  of  whoa  though  that  they  had 
found  the  "ideal  algorithm"  until  they  fully 
appreciated  the  significance  of  the  test  point  re¬ 
quirement  which  severely  limits  its  epplicabilicy. 
proa  the  point  of  view  of  our  other  criteria, 
however,  the  algorithm  is,  indeed,  "ideal".  Off- 
lina  computational  requirements  are  moderate  while 
on-line  computational  requirements  srs  minimal. 
Moreover,  the  algorithm  is  inherently  robust  and 
requires  no  simulation  models  of  any  hind,  it 
tests  modules,  and  it  is  amenable  to  in-situe 
testing.  Finally,  the  computational  requirements 
associated  with  the  algorithm  are  sufficiently  low 
so  it  to  render  the  parallel  processing  question 
moot. 

C.  Simulation -After-Test  with  Multiple  Test 
Vectors:  One  approach  to  reducing  the  test  point 
requirements  of  the  simulation-after-test  algorithm 
is  to  use  multiple  test  vectors  to  increase  the 
number  of  equations  obtained  from  a  given  set  of 
test  points,  thereby  rendering  the  fault  diagnosis 
equation  Soluable  with  e  restricted  number  of  test 
points.  The  most  common  form  of  the  multiple  test 
vector  algorithm  is  the  multifrequency  algorithm 
used  in  linear  fault  diagnosis,  though  the  concept 
extends  to  the  nonlinear  case  via  the  use  of 
multiple  test  vectors  of  any  type.1*1 

The  reduced  test  point  requirement  obtained  via 
the  use  of  multiple  test  vectors  is,  however, 
achieved  at  the  cost  of  greatly  increasing  the 
complexity  of  the  resultant  fault  diagnosis  equa¬ 
tions.  Indeed,  the  "almost  linear"  equations  of 
the  single  test  vector  algorithm  are  replaced  by  . 
an  extremely  complex  set  of  nonlinear  equations 
(even  for  linear  systems)  in  the  multiple  test 
vector  algorithm.  Although  these  equations  can  be 
made  tradeable  in  the  linear  case  they  appear  to 
be  totally  untrackable  in  the  nonlinear  case  and, 
as  such,  most  of  the  advantages  of  the  simulation- 
after-test  concept  are  lost  when  multiple  test 
vectors  ere  employed. 

With  regard  to  our  criteria  the  multiple  test 
vector  algorithms  require  large  amounts  of  on-line 
computer  time  though  relatively  little  off-line 
computer  time  is  required.  In  its  most  obvious 
form  the  technique  is  robust,  though  this  robust¬ 
ness  is  compromised  by  most  of  the  "tricks”  which 
have  been  proposed  to  make  the  multiple  test  vector 
fault  diagnosis  equations  tradeable.  Faulted 
simulation  models  are  required  and  the  algorithm 
is  Inherently  parameter  oriented.  Finally,  it  is 
,  not  suited  to  either  in-situe  testing  or  parallel 
implementation. 

CONCLUSIONS 

The  above  concepts  are  summarized  in  Table  1, 
"herein  the  various  criteria,  by  which  an  analog 
CAT  algorithm  should  be  measured  are  tabulated, 
the  goals  for  an  ideal  algorithm  art  described, 


and  the  degree  to  which  the  various  algorithms 
achieve  these  goals  is  indicated.  From  the  table 
it  is  apparent  that  none  of  the  algorithms  is  fully 
acceptable.  Indeed,  even  if  one  neglects  the 
secondary  considerations  regarding  modules  vs. 
parameters,  in-situe  testing,  and  parallel  process¬ 
ing  all  three  approaches  fail  to  meet  one  or  more 
of  the  primary  criteria  (indicated  by  capital 
letters  in  the  table) .  As  such,  the  proper 
approach  to  the  solution  of  the  analog  CAT  problem 
restains  an  open  question. 
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Table  1:  Performance  of  Analog  Fault  Diagnosis  Algorithms.  Unacceptable 
performance  factors  are  Indicated  by  capital  letters. 
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Abstract 

A  simulation-after-test  algorithm  for  the  analog  fault  diagnosis  problem  is  pro¬ 
posed  in  which  a  bound  on  the  maximum  number  of  simultaneous  failures  is  used  to 
minimize  the  number  of  test  points  required.  The  resultant  algorithm  is  appli¬ 
cable  to  both  linear  and  nonlinear  systems  and  can  be  used  to  isolate  a  fault  up 
to  an  arbitrarily  specified  "replaceable  module*. 


I.  INTRODUCTION 

Conceptually,  analog  fault  diagnosis  algorithms 
can  be  subdivided  into  three  classes;3  simulation- 
before-test.  simulation-after-test  with  a  single 
test  vector,  and  simulation-after-test  with  multi¬ 
ple  test  vectors.  The  former  is  consnonly  employed 
in  digital  testing  and  is  characterized  by  minimal 
on-line  computational  requirements.  Unfortunately, 
the  high  cost  of  analog  circuit  simulation  coupled 
with  the  large  number  of  potential  fault  modes 
which  must  be  simulated  in  an  analog  circuit  limits 
the  applicability  of  simulation-before-test  al¬ 
gorithms  In  an  analog  test  environment.  As  an 
alternative  to  simulation-before-test,  a  number  of 
researchers  have  proposed  simulation-after  test  al¬ 
gorithms,  in  which  the  internal  system  variables  or 
component  parameters  are  computed  from  the  test 
data  via  a  "nonlinear  equation  solver  -  like"  al¬ 
gorithm.  Indeed,  in  the  case  where  sufficiently 
many  test  points  are  available  only  a  single  test 

vector  is  required  and  the  fault  diagnosis  problem 

o  q 

reduces  to  the  solution  of  a  linear  equation.  • 
Except  for  the  large  number  of  test  points  required 


this  approach  is  Ideally  suited  to  the  analog  fault 
diagnosis  problem  and,  as  such,  a  considerable  re¬ 
search  effort  has  been  directed  towards  the  problem 
of  reducing  its  te*t  point  requirements.3  One  such 
approach  uses  multiple  test  vectors  to  Increase  the 
number  of  equations  obtained  from  a  given  set  of 
test  points.  Unfortunately,  this  is  achieved  at 
the  cost  of  greatly  complicating  the  set  of  simul 
tanenus  equations  which  must  be  solved  and,  as  sudi, 
the  applicability  of  the  approach  is  limited. 

The  purpose  of  the  present  paper  is  to  describe  an 
alternative  simulation-after- test  algorithm  in 
which  a  bound  on  the  maximum  number  of  simultaneous 
failures  Is  used  to  reduce  the  test  point  require¬ 
ments  while  still  retaining  the  computational  sim¬ 
plicity  inherent  in  a  single  test  vector  algorithm. 
Indeed,  even  though  a  circuit  may  contain  several 
hundred  components  it  is  reasonable  to  assume  that 
at  most  two  or  three  have  failed  simultaneously. 

As  such,  rather  than  solving  a  set  of  simultaneous 
equations  in  n-space  the  solution  to  our  fault 
diagnosis  problem  actually  lies  in  a  two  or  three 
dimensional  submanifold  which  should  yield  a 
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coemensurate  reduction  in  test  point  requirements. 
Unfortunately,  even  though  we  may  assume  that  at 
most  two  or  three  components  have  failed  we  do  not 
know  which  two  or  three,  and  as  such,  some  type  of 
search  is  still  required.  Fortunately,  with  the 
aid  of  an  appropriate  decision  algorithm  the  re* 
quired  search  can  he  implemented  quite  simply. 


Consider  the  circuit  or  system  which  is  illustrat¬ 
ed  in  figure  1.  i 


®  / 


Figure  1.  Test  algorithm  for  abstract  circuit  or 
system. 


Here,  the  individual  circuit  components  or  sub¬ 
systems  are  denoted  by  circles  indexed  from  a  to  n. 
These  components  are  subdivided  into  two  groups, 
at  each  step  of  the  test  algorithm,  as  Indicated 
by  the  dashed  lines  in  figure  1.  At  each  step  we 
assume  that  one  group;  say,  d  through  n;  is  com¬ 
posed  of  good  components  and  we  use  the  known 
characteristics  of  these  components  together  with 
the  test  data  to  determine  whether  or  not  the  re¬ 
maining  components;  a.b  and  c  in  this  case;  are 
good.  Of  course,  if  components  d  through  n  are 
actually  good  then  the  resultant  test  results  for 
components  a,  b  and  c  will  be  reliable.  On  the 
other  hand,  if  any  one  of  the  components  d  through 
n  is  faulty  the  test  data  on  a,  b  and  c  will  be 
unreliable.  As  such,  we  repeat  the  process  at  the 
next  step  of  the  test  algorithm  with  a  different 
subdivision  of  components.  For  instance,  we  may 
assume  that  a  through  d  and  h  through  n  are  good 
and  use  their  characteristics  to  test  components 
e,  f  and  g.  Finally,  after  a  number  of  such 
repetitions  the  test  results  obtained  at  various 
steps  are  analyzed  to  determine  the  faulty  com¬ 
ponents. 


Of  course,  the  number  of  components  which  may  be 


tested  at  one  step  is  dependent  on  the  number  of 
test  points  available  while  the  number  of  steps 
required  is  determined  by  the  number  of  components 
which  may  be  tested  at  any  one  step  and  the  bound 
on  the  maximum  number  of  simultaneous  failures. 

As  such,  the  procedure  yields  a  natural  set  of 
tradeoffs  between  the  numbers  of  test  points,  sim¬ 
ultaneous  failures  and  steps  required  by  the  al¬ 
gorithm.  Indeed-,  since  the  computational  cost 
associated  with  each  step  of  the  algorithm  is 
essentially  the  cost  of  a  single  system  simulation 
the  latter  parameter  is  a  natural  measure  of  the 
computational  cost. 

In  the  following  section  we  describe  the  simula¬ 
tion  model  used  to  test  one  set  of  components 
under  the  assumption  that  the  remaining  components 
are  good.  In  section  three  two  decision  algorithms 
for  analyzing  the  resultant  test  data  are  describ¬ 
ed.  Indeed  the  required  theory  is  reminiscent, 
though  not  identical  to,  the  t-dlagnosibility 
theory  developed  for  digital  testing  over  the  past 

A  g 

decade.  •  Finally,  section  four  is  devoted  to  a 
number  of  examples  Including  linear  circuits  with 
12  and  22  components  which  were  run  on  a  desktop 
calculator  and  a  16  bit  mini,  respectively. 


II.  THE  SIMULATION  MODEL 

Although  our  test  algorithm  can  be  formulated  in 
terms  of  any  of  the  standard  system  models  for  the 
purpose  of  this  exposition  we  will  assume  a  com¬ 
ponent  connection  model  for  the  circuit  or  system 
2 

test.  In  the  nonlinear  case  the  unit  unden  teat 
is  represented  by  a  set  of  decoupled  state  models 
characterizing  its  components  and/or  subsystems 
together  with  an  algebraic  connection  equation  as 
follows. 


*i "  VvV 

;  Xj (0)  »  0,1*1 ,2, 

bf  ■  g^Xj.a,) 
and 


Lllb  +  L21u 


,n  (2.1) 

(2.2) 


y  «  l21b  +  l22u  (2.3) 

Here,  a  •  col(a^)  is  the  column  vector  composed  of 


the  component  input  variable s,  b  *  col(b^)  Is  the 
column  vector  composed  of  component  output  vent* 
ablu,  u  Is  the  vector  of  exttAnai  tut  inputs 
applied  to  the  system  and  y  is  the  vector  of  as¬ 
tern  usponsu  measured  at  the.  various  teat  points. 
Although  the  component  connection  model  is  not 
universal  it  is  quite  general  and  subsumes  most 
of  the  classical  topological  connection  models 
commonly  used  in  circuit  and  system  theory.* 
Moreover,  its  Inherently  decoupled  nature  is 
ideally  suited  to  the  test  problem  wherein  we  de¬ 
sire  to  distinguish  between  the  characteristics 
of  the  individual  system  components.  Although 
these  components  may  be  taken  to  be  elementary 
RLC  components  and/or  discrete  semiconductor  de¬ 
vices,  in  practice  the  “components1'  are  taken  to 
be  the  “replaceable  modules”  within  the  circuit 
or  system,  under  test;  say,  an  IC  or. a  “throw¬ 
away*  circuit  board. 

At  each  step  of  the  test  algorithm  we  subdivide 
the  "components  into  two  groups  denoted  by  “1*  and 
"2“  with  the  components  In  group  "1“  assumed  to  be 
good  and  used  together  with  the  known  values  Of  u 
and  y  to  compute  the  component  input  and  output 
variables,  a^  and  b^,  for  the  components  in  group 
"2“.  Although  computationally  we  prefer  to  work 
with  the  decoupled  component  equations  for  nota- 
tional  brevity  we  combine  the  equations  for  the 
components  in  each  group  into  a  single  equation 

i1  -  fV.a1) 


g1(x1 .a1) 


.2  ,  ,21K1  .  .  22.2  .  .  2  „ 

a  lnb  ♦  L„b  ♦  L12u 

y  -  l^b1  ♦  i.f,b2  ♦  l22u 


f2(x2,a2) 


b2  -  g2(x2,a2) 

Here,  x^a1  and  b1  are  the  vectors  of  group  ”1“ 

component  state  variables.  Inputs  and  outputs; 

2  2  2 

and  similarly  for  x  ,a  and  b  .  To  retain  nota- 
tional  compatibility  with  2.4  and  2.5  we  reorder 
and  partition  the  connection  equations  of  2.2  and 
2.3  to  be  conformable  with  2.4  and  2.5  as  follows 

a1  -  ijjb1  ♦  l]2b2  ♦  l]2u  (2.6) 


Given  equations  2.4  through  2.8  our  goal  is  to  com- 

2  ? 

pute  the  goup  “2"  component  variables,  a  and  b  , 

given  the  tut  input,  u,  the  mtasu/ud  tut 

usponsu,  y,  and  an  assumption  to  tht  e({cet  that 

tht  gnoup  *1"  components  one  not  (putty.  To  this 

end  we  assume  that  L?,  admits  a  left  Inverse, 

2  -L  “ 

fL2i ]  ,  which,  in  turn,  determines  the  allowable 

component  subdivisions.  Under  this  assumption  one 
may  then  formulate  a  component  connection  model 
for  a  "pseudo  c intuit"  composed  of  the  group  “1" 
components  with  external  input  vector  up  • 
col(u  ,  y)  and  external  output  vector  yp  » 
col (a2,  b2)  in  the  form 

x1  ■  f^x^.a1) 


9W) 


(2.10) 

(2.11) 


Since,  in  our  test  problem  u  and  y  are  known,  the 

above  equations  can  be  solved  via  any  standard 

analysis  code  to  compute  yp  ■  (a2,b2).  Now,  under 

our  assumption  that  the  group  "1“  components  are 

not  faulty  yp  ■  (a2,b2)  represents  the  inputs  and 

outputs  which  actually  appeared  at  the  terminals 

of  the  group  "2"  components  during  the  test.  As 

such,  we  may  determine  which  of  the  group  "2" 

components  are  faulty  by  solving  equation  2.5  with 
2 

Input  a  and  checking  to  determine  whether  or  not 

i 

the  resultant  output  coincides  with  b  .  Of  course, 
since  our  assumption  to  the  effect  that  the  group 
“1"  components  are  not  faulty  may  not  be  valid  the 
xuutts  o{  this  tut  ate  not  utiabte.  As  such, 
we  repeat  the  process  a  number  of  times  with 
different  choices  for  the  subdivision  of  the  com¬ 
ponents  into  group  "1"  and  group  ”2“.  Here,  the 

only  constraint  on  the  choice  of  subdivisions  is 
2  -L 

the  requirement  that  d2^]  exist  while  the  num¬ 
ber  of  combinations  employed  is  limited  only  by 
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the  cost  of  the  required  simulations.  The  results 
of  the  several  steps  in  the  test  algorithm  are 
then  analyzed  via  the  techniques  described  in  the 
following  section  to  determine  those  components 
which  are  actually  faulty. 

III.  DECISION  ALGORITHMS 

Since  the  results  of  the  test  described  in  the 
preceding  section  are  dependent  on  our  assumption 
that  the  group  "I”  components  are  not  faulty  they 
are  not  immediately  applicable.  Following  the 
philosophy  initiated  by  Preparata,  Metze,  and 
Chein®  in  their  study  of  self  testing  computer 
networks,  however,  if  one  assumes  a  bound  on  the 
maximum  number  of  faulty  components  it  is  possible 
to  determine  the  actual  fault(s)  from  an  analysis 
of  the  test  results  obtained  at  various  steps  in 
the  algorithm.  To  this  end  we  will  give  a  com¬ 
plete  analysis  of  the  theory  required  to  locate  a 
single  fault  together  with  an  heuristic  which  is 
applicable  to  the  multiple  fault  case. 

Let  us  assume  that  at  most  one  circuit  component 
Is. faulty  and  that  the  test  results  obtained  from 
a  given  step  of  the  algorithm  indicate  that  all 
group  "2*  components  are  good  as  Indicated  in  the 
following  table,  where  the  binary  anotatln" 
the  left  of  the  group  *2“  components  indicate 
those  which  were-found  to  be  good(O)  ana  oad  (1) 
at  this  step  of  the  test  algorithm 


*2> 

*r 

X 

a  b  c  ...  k 

0 

•  X 

0 

* 

y 

• 

0 

z 

In  this  case  we  claim  that  the  group  "2"  components 
are.  In  fact,  good.  Indeed,  if  a  group  two  com¬ 
ponent  were  actually  faulty  then  our  test  results 
are  incorrect,  which  could  only  happen  if  one 
of  the  group  “1"  components  was  faulty.  As  such, 
the  system  would  have  two  faulty  components  con¬ 
tradicting  our  assumption  to  the  effect  that  at 
most  one  component  is  faulty. 

Now,  consider  the  case  where  the  results  from  a 


given  step  of  the  test  algorithm  Indicate  that  ex¬ 
actly  one  group  “2"  component  is  faulty;  say,  x. 


\ 

"2* 

a  b  c  ....  k 

1 

X 

0 

• 

y 

• 

• 

0 

z 

In  this  case  the  same  argument  we  used  above  will 
guarantee  that  the  components  which  test  good;  say, 
y  through  z;  are,  in  fact,  good.  On  the  other  hard 
we  have  no  information  about  x.  It  may  be  faulty, 
or  alternatively,  the  test  result  may  be  due  to  a 
faulty  group.  "1"  component. 

Finally;-  consider  the  case  where  two  or  more  group 
“2"  components  test  bad  in  a  given  step  as  indicat¬ 
ed  in  the  following  table. 
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Since,  under  our  assumotlon  of  a  single  failure.  It 
1sr1mposs1b1e  for  two  or  more  group  *2"'  components 
to  be  faulty,  this  test  result  implies  that  at 
least  one  of  the  group  *1"  components  is  bad.  On 
the  other  hand  sl-nce  we  have  assumed  that  there  is 
at  most  one  faulty  component  the  faulty  group  "1* 
component  is  the  only  faulty  comoonent  and,  as 
such,  the  group  *2*  components  are  all  good. 

Consistent  with  the  above,  at  each  step  of  the  test 
algorithm,  either  all  or  all  but  one  group  “1" 
components  are  found  to  be  good.  As  such,  if  we 
choose  our  subdivisions  so  that  the  components 
which  are  found  to  be  good  at  one  step  of  the  al¬ 
gorithm  are  Included  in  group  “1"  In  all  succeed¬ 
ing  steps  we  eventually  will  arrove  at  a  group  "1", 
all  of  whose  components  are  known  to  be  good.  As 
such,  the  test  results  obtained  at  that  step  will 
be  reliable  thereby  allowing  us  to  accurately  de¬ 
termine  the  faulty  components  in  group  "2*. 


Unlike  the  single  fault  case,  at  the  time  of  this 
writing,  we  do  not  yet  have  an  exact  decision  al¬ 
gorithm  for  the  multiple  fault  case.  Following 
Liu,  however,  the  problem  can  be  greatly  simpli¬ 
fied  If.  one  adopts  an  "analog  heuristic"  to  the 
effect  that  two  Independent  analog  failures  will 
never  cancel.* 

Recall  from  our  discussion  of  the  single  fault 
case  that  whenever  a  test  result  Indicates  that  a 
component  is  good  then  It  Is,  In  fact,  good.  Al¬ 
though  this  Is  not  rigorously  true  In  the  multiple 
failure  case  it  1$  true  under  the  assumption  of 
our  heuristic.  For  Instance,  consider  the  test 
results  Indicated  In  the  following  table  In  which 
x  is  found  to  be  good. 


how,  If  x  Is  actually  faulty  there  must  be  a  faul¬ 
ty  group  "1"  component  whose  effect  Is  to  cancel 
the  error  tn  x  as  observed  during  this  step  of  the 
test  algorithm.  This  Is,  however,  forbidden  by 
our  heuristic  and,  as  such„  we  conclude  that  x'  is 
actually  good. 

Interestingly,  our  heuristic  can  be  carried  a  step 
further  than  Indicated  above  since,  under  our 
heuristic,  a  bad  group  "1"  component  would  normal¬ 
ly  yield  erroneous  test  results.  An  exception 
would,  however,  occur  if  some  of  the  group"!" 
components  are  totally  decoupled  from  some  of  the 
group  "2"  components.'  As  such.  If  prior  to  our 
test  we  generate  a  coupling  table  (by  simulation 
or  a  sensitivity  analysis)  which  Indicates  whether 
or  not  a  faulty  group  "1"  component  will  effect 
test  results  on  a  group  "2"  component,  our  heuris¬ 
tic  may  be  used  to  verify  that  certain  group  "1* 
components  are  good  whenever  a  good  group  “2"  com¬ 
ponent  is  located.  Consider,  for  example,  the 
following  table  In  which  a  "1"  in  the  1-J  position 
indicates  that  the  test  results  for  component  1 


\ 

"2“ 

•m  m 

"s 

a  b  c  ...  k 

0 

X 

10  1  1 

1 

• 

y 

110  0 

•  •  •  e 

• 

• 

0 

z 

•  e  •  • 

•  •  •  • 

•  011  0 

are  effected  by  component  j  while  a  *0"  In  the  1-j 
position  Indicates  that  component  j  does  not  effect 
the  test  results  for  component  1.  Now,  since  com¬ 
ponent  z  has  been  found  to  be  good  in  this  test  our 
heuristic  Implies  that  b  and  c  are  also  good.  Simi¬ 
larly,  since  component  x  is  good  so  are  a,c,  and  k. 
Thus,  with  a  single  test  we  have  verified  that  x,z, 
a,b,c.and  k  are  all  good. 

IV.  EXAMPLES 

To  obtain  examples  the  above  techniques  were 
applied  to  the  12  and  22  component  linear  amplifier 
circuits  shown  In  figure  2  using  simulated  test 
data  for  various  numbers  of  simultaneous  failures, 
choices  of  test  point  locations,  and  both  decision 
'algorithms.  All  analysis  for  the  12  component 
circuit  was  done  on  an  HP  982S  desktop  calculator 
while  the  22  component  examples  were  run  on  a  II-  - 
990/20  minicomputer.  The  results  of  sane  ISO  simu¬ 
lations  of  the  algorithms  are  tabulated  in  table  1. 
where  the  number  of  test  points,  simultaneous 
faults,  and  the  decision  algorithm  employed  are  in¬ 
dicated.  The  results  of  the  various  simulations 
are  Indicated  by  the  ambiguity  of  the  resultant 
diagnosis.  For  instance,  in  our  simulation  of  the 
12  component  circuit  with  3  test  points,  one  fail¬ 
ure  and  the  exact  algorithm  12  runs  were  made  (one 
with  each  component  fauly).  On  10  occasions  the 
fault  was  located  exactly  while  the  fault  was  lo¬ 
cated  up  to  an  ambiguity  set  composed  of  two  compon¬ 
ents  on  2  occasions.  Finally,  we  note  that  the  5th 
run  of  the  12  component  circuit  indicated  by  an  as- 
terlck  In  the  table  represents  a  simulation  In 
which  the  good  components  were  set  at  +/-2S  off  of 
nominal  to  test  the  robustness  of  the  algorithm. 
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capacitance  values  of  .OOlwf  while  the  feedback  caplcators  have  nominal  values  of  lOOpf. 
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Table  1.  Simulated  test  data.  *  indicates  a  simulated  test  in  which 
the  good  components  were  taken  to  be  */-  2  off  of  nominal. 
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A  DIFFEREMTIAL-INTERPOLATIVE  APPROACH  TO 
ANALOG  FAULT  SIMULATION 

C.-c.  Hu**,  A.  Sanqlovannl-Vencentelll*,  and  R.  Sacks** 


I.  INTRODUCTION 

After  a  half  century  of  neglect  by  the  cir¬ 
cuits  and  systems  comnunlty  the  past  decade  has 
witnessed  the  emergence  of  a  research  effort  In  the 
analog  circuit  maintenance  area.  The  various  al¬ 
gorithms  which  have  been  thus  far  proposed  for  the 
analog  fault  diagnosis  problem  may  naturally  be 
suodivlded  into  two  classes  termed  “simulatlon- 
before-test"  and  ‘simulation-after-test".  The 
former  are  coxmonly  used  in  digital  system  test  al¬ 
gorithms  and  regulre  an  automatic  test  program 
generator  (ATPG)  which  simulates  the  responses  of 
"all  possible"  failures.  This  Is  typically  done  at 
a  maintenance  depot  with  the  simulated  responses 
being  recorded  and  shipped  to  the  field  where  the 
response  of  the  unit  under  test  (UUT)  Is  compared 
with  the  simulated  responses  to  determine  the 
failure.  The  major  advantage  of  slmulatlon-before- 
test  Is  that  it  is  Ideally  matched  to  the  depot/ 
field  maintenance  environment  with  the  largest  part 
of  the  computation  done  only  once.  As  such,  the 
technigue  Is  Ideally  suited  for  digital  testing 
where  the  binary  nature  of  the  problem  keeps  the 
number  of  failures  to  be. simulated  within  bounds 
and  eliminates  tolerance  problems.  Unfortunately, 
In  the  analog  problem  we  must  cope  with  a  continuum 
of  possible  failures  and  simultaneously  deal  with 
good  components  which  are  In  tolerance  but  not  nom¬ 
inal.  As  such,  a  tremendous  number  of  simulations 
are  required  by  a  simulation-before-test  algorithm, 
while  some  type  of  decision  algorithm  Is  required 
to  cope  with  the  tolerance  effects. 

Unlike,  simulation-before-test,  simulation-after 
test  uses  an  "equation  solver-like"  algorithm  to 
compute  the  parameters  of  the  UUT  components  in  the 
field.  Since  most  such  algorithms  require  Iterative 
evaluation  of  the  equation  to  be  solved,  the  UUT  is 
effectively  simulated  at  each  Iteration,  though  the 
simulation  is  based  on  actual  test  data  rather  than 
hypothesized  failure  data.  The  simulation  process 
is,  thus,  carried  out  after  testlnq  the  UUT  and 
hence  the  choice  of  terminology.  The  advantage  to 
such  an  approach  is  that  the  faulty  component  para¬ 
meters  are  computed  explicitly,  thereby,  eliminat¬ 
ing  the  ambiqulty  caused  by  the  use  of  discrete 
simulation-before-test  data  and  tolerance  effects. 
Althouqh  relatively  few  simulations  are  required 
for  each  UUT’,  they  must  be  carried  out  in  the  field 
rather  than  the  depot  and  they  must  be  repeated  for 
each  UUT. 

The  purpose  of  the  present  paper  Is  to 


1 

describe  a  research  effort  directed  at  alleviating 
some  of  the  difficulties  In  developing  a  simulattm- 
before-test  algorithm  for  analoq  fault  diagnosis. 
The  underlying  philosophy  and  motivation  for  our 
formulation  is  discussed  in  section  2,  alonq  with  a 
derivation  of  the  required  dlfferential-interpola- 
tlve  fault  diagnosis  formula.  Finally,  section  3 
is  devoted  to  a  number  of  Illustrative  examples  of 
the  approach.  These  Include  both  linear  and  non¬ 
linear  examples  formulated  In  the  frequency  and 
time  domains,  respectively. 

II.  A  DIFFERENTIAL- INTERPOLATIVE  ALGORITHM 

Although  any  practical  fault  diagnosis  algo¬ 
rithm  must  be  able  to  handle  systems  with  a  hundred 
or  more  components,  from  an  intuitive  point  of  view 
our  algorithm  is  best  illustrated  in  the  two  com¬ 
ponent  cases  where  the  parameter  space  can  be  dis¬ 
played  graphically.  Say,  we  are  dealing  with  an  RC 
circuit  for  which  the  parameter  space  Is  illustrat¬ 
ed  In  figure  1. 


Figure  1:  Parameter  space  for  RC  circuit. 


Here,  R  and  C  represent  normalized  parameter  values, 
wherein,  the” nominal  parameter  values  are  transibrm- 
ed  to  the  origin.  In  the  most  general  slmulation- 
before-test  algorithms  one  assumes  that  the  faulty 
parameter  values  may  lie  anywhere  in  the  R-C  plane 
and  therefore  carries  out  simulations  along'a  two 
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dimensional  discrete  array  spread  over  the  entire 
plane. 

Fortunately,  In  a  “real  world*  testing  environ¬ 
ment  one  can  assume  that  only  a  “limited  number  of 
components*  fall  simultaneously.  In  our  two  com¬ 
ponent  example  we  may  therefore  assume  that  either 
R  or  C  has  failed  with  the  other  remaining  nominal. 
In  which  case  the  circuit  need  only  be  simulated 
at  a  discrete  set  of  points  along  the  coordinate 
axes  In  the  R-C  plane  denoted  by  x's  in  figure  1. 

As  such,  the~mm>ber  of  simulations  regulred  Is  sig¬ 
nificantly  decreased.  Indeed,  this  Is  one  of  the 
major  advantages  of  the  slmulatlon-before-test 
concept  as  compared  to  slmulatlon-after-test  algo¬ 
rithms  which  typically  fall  to  exploit  a  “limited 
rumber  of  failures*  assumption. 

While  the  above  described  approach  has  been 
used  with  considerable  success  In  digital  system 
testing,  wherein,  the  axes  are  binary  and  no  toler¬ 
ance  problems  are  encountered,  it  Is  not  well  suit¬ 
ed  to  the  analog  test  problem.  First,  an  analog 
failure  may  occur  anywhere  along  the  axis  and  hence 
some  type  of  approximation  scheme  Is  regulred  to 
Interpolate  between  the  discrete  simulations. 
Secondly,  a  “good*  component  Is  assumed  to  be  In¬ 
tolerance,  though  It  may  not  be  nominal.  As  such, 
in  an  analog  environment  the  “limited  number  of 
failures*  assumption  Implies  that  the  solution  to 
our  fault  diagnosis  problem  lies  near,  but  not 
necessarily  on.  the  coordinate  axes  as  Indicated  by 
the  shaded  regions  In  figure  2a. 


assumption. 

b)  Illustration  of  the  dlfferentlal- 
interpolatlve  diagnosis  algorithm. 


While  we  might  choose  to  simply  fill  the  shaded 
region  with  additional  simulations,  the  cost  of 
such  a  process  may  prove  to  be  excessive.  Rather, 
we  exploit  the  fact  that  the  deviations  of  qood 
component  parameters  from  nominal  are  small  and  use 
a  1st  order  Taylor  series  approximation  to  approx¬ 
imate  the  deviation.  We  note  that  such  an  aoproach 


cannot  be  used  to  locate  the  faulty  parameter  values 
which  may  be  far  from  nominal;  Indeed,  It  is  often 
infinite  or  zero;  though  it  can  be  used  to  cope 
with  the  tolerance  effects. 

Our  dlfferentlal-interpolative  approach  thus 
uses  a  classical  minimum  distance  algorithm  to  lo¬ 
cate  the  qeneral  region  of  the  faulty  parameter 
values  Indicated  by  the  circle  in  figure  2a  (which 
1$  magnified  In  figure  2b).  Now,  It  Is  assumed 
that  the  simulated  values  of  the  system  responses; 
fj,  f2,  and  fji  corresponding  to  the  points;  c^.Cg. 

and  c-;  are  available  along  with  the  associated  1n- 

J  -1  -1  -1 

verse  sensitivity  matrices;  0,  ,  J2  .and  J3  .  We 

then  Interpolate  these  data  points  to  approximate 
the  system  responses  and  the  associated  Inverse 
sensitivity  matrlce  along  the  axis  by  functions 

f(c)  and  J(c)"\  Although  any  Interpolation  can  be 
employed  we  have  had  our  best  results  using  a  bi¬ 
linear  interpolation  for  f  (which  gives  exact 
results  In  the  linear  case)  and  a  second  order 

polynomlc  interpolation  of  J”^.  Now,  If  x  denotes 
the  faulty  parameter  vector  and  m  denotes  the 
measured  system  responses  then  a  1st  order  Taylor 
series  approximation  combined  with  our  Interpola¬ 
tion  will  yield  the  (approximate)eguality 

m  ■  f (c)  ♦  0(c)[x  -  c]  1. 

for  those  values  of  c  near  x.  Equivalently, 

[x  -  c]  ■  J(e)’'[w  -  f(c»  2. 

Interestingly,  by  Invoking  the  Projection 
theorem  one  can  reduce  the  above  vector  eguatlon  to 
a  scalar  eguatlon  and  simultaneously  eliminate  the 
requirement  for  storing  the  Inverse  sensitivity 
matrices.  Indeed,  the  vector  tx-c)  will  be  perpen¬ 
dicular  to  the  axis  at  the  point  c  which  makes  the 
closest  approach  to  the  fault.  As  such.  If  ec  de¬ 
notes  the  unit  vector  in  the  direction  of  the  axis 
then 

0  -  e*[x  -  c)  •  e*J(cr1[m  -  f(c))  3. 

which  can  be  solved  for  the  faulty  parameter  value, 
c.  Note,  our  goal  Is  to  solve  for  t.  not  x,  since 
we  are  interested  In  locating  the  faulty  parameter 
value  in  the  presence  of  the  tolerance  problem,  but 
we  really  do  not  care  to  compute  the  deviations 
from  omlnal  In  the  good  parameters. 

To  suemarlze,  if  rather  than  simply  storing 
the  simulated  circuit  responses,  f, .  we  also  store 
t  -1  1 

the  vectors  e‘Jj  then  the  tolerance  effects 

associated  with  the  good  components  can  be  com¬ 
pletely  removed  from  our  fault  diagnosis  algorltnm- 
at  least  up  to  the  approximation  error  Induced  by 
the  Interpolation  process  and  Taylor  series  ex¬ 
pansion.  Since  most  good  circuit  simulation  cooes 
Include  a  packaqe  for  generating  sensitivity  ma¬ 
trices  at  little  additional  cost  over  and  above 
that  Involved  In  simply  simulating  the  circuit 
responses,  the  approach  can  be  implemented  with 
only  a  minimal  Increase  in  simulation  costs.  At 
such,  the  major  expense  associated  with  the 
aporoach  lies  with  the  storage  requirements  (for 
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the  end  vectors)  which  are  approximately 

double  that  of  a  classical  fault  simulation  algo¬ 
rithm. 

Although  the  above  derivation  has  been  illus¬ 
trated  in  the  two  dimensional  case  with  a  single 
faulty  parameter  it  can  be  readily  extended  to  a 
general  setting,  say  with  several  hundred  comporente 
and  three  or  four  simultaneous  faults.  If  one 
assumes  p  simultaneous  faults  then  p  inner  products 
are  required  to  apply  the  Projection  theorem  yield¬ 
ing  p  equations  and  p  unknowns  to  be  solved  for  the 
faulty  parameter  values.  Otherwise  the  formulation 
for  the  general  case  is  identical  to  the  slnqle 
fault  case  described  above. 

III.  EXAMPLES 

In  this  section,  two  examples  are  given,  one 
of  them  for  linear  systems  and  one  for  the  nonlinear 
case.  All  of  these  examples  were  simulated  on  an 
HP9825A  programmable  calculator,  and  yielded  fairly 
good  results. 

Our  first  example  is  a  second  order  low  pass 
filter.  The  filter  contains  five  components,  K, 

R-| .  Rj,  C^,  and  C2,  while,  the  circuit  diagram  is 

shown  in  figure  3-1 


Figure  3-1. 

The  transfer  function  for  this  circuit  is 
qlven  by 

f(r.s)  •  3 - - - (3-1) 

S<CtC2R1R2  ♦  s[R2C2  ♦  R,C2  ♦ 


The  partial  derivatives  of  the  transfer  func¬ 
tion  with  respect  to  each  parameter  take  the  form 
0  ♦  SC^k 

-j 


|J(r,s) 


(3-2) 
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(3-4) 

(3-5) 


-Kirc^Rj  ♦  SR?  ♦  SR1 7 


(3-6) 


where  0  ■  S^C^Rj  ♦  SfR^  ♦  R^  ♦  R^O-K))*  1 


Since  we  have  five  parameters  in  the  transfer 
function,  five  distinct  test  frequencies  are  re¬ 
quired  to  provide  sufficient  information  for 
diagnosis. 

The  fault  diagnosis  results  are  listed  in 
table  3-1.  Here,  the  nominal  values  of  K.R^.Rj.Cj, 

C2  are  1.6,  IKfi,  lto,  0.16  F  and  0.16uF 
respectively  and  the  faulty  parameter  1$  underlined 
in  the  table 


Table  3-1 
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In  the  first  simulation,  K  Is  the  faulty  com¬ 
ponent  with  a  value  of  <X6,  while  the  other  four  com¬ 
ponents  are  5*  or  so  off  their  nominal  values,  the 
simulation  result  shows  that  K  failed,  and  locates 
it  at  K-0.591.  The  same  remarks  apply  to  the  other 
five  simulations. 


Although  the  technique  generally  yields  satis¬ 
factory  results  occasional  errors  occur  when  the 
good  components  are  too  far  out  of  tolerance.  For 
instance,  the  following  parameter  values  K*1.62,  R, 
■1070,  R2*910,  C^*0.5  ,  and  C2«0.172w  led  to  an 

erroneous  result.  The  simulation  shows  that  C2  has 

failed  with  the  value  of  0.179u.  However,  the 
faulty  component.  In  this  simulation,  is  actually 
Ci.  If  we  sketch  a  two  dimensional  representation 
of  the  Ci,  C»  plane  the  difficulty  becomes  clear. 
Figure  3-2  snows  that  C?  is  too  far  away  from  its 
own  nominal  value,  an*  thus  instead  of  locatinq  the 
error  atoas  we  expect  the  sinulation  resut  locates 
the  failure  at  6,  with  the  differential  term  still 
pointing  toward  the  actual  failure  denoted  by  x. 
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Figure  3-2 
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Our  nonlinear  example  is  composed  of  a 
diode  loaded  by  a  shunt  RC  circuit  as  illustrated 
in  Figure  3-3.  The  diode  Is  modeled  by  the 


characteristic  function^  ,y 

1  •  He  0  *  - 


1) 


(3-7) 
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Figure  3-3 


Now,  instead  of  working  with  frequency  domain 
transfer  functions,  we  work  In  the  time  domain.  A 
state  equation  for  this  circuit  Is  given  by 


1  ,  <vrvc>/vt 


(3-a) 


The  goal  is  to  Inteqrate  this  differential 
equation  so  as  to  build  a  V  (?)  vector  and  f(r) 
vectors  as  In  the  previous  examples . 


Numerical  techniques  can  be  used  to  compute 
Vc(t)  at  any  Instant  t.  In  this  example,  the 

Vc(r)  vector  was  elevated  by  applylnq  the  fourth 
Runqe-Kutta  method.  Note  that  since  there  are  four 
Independent  parameters.  R,  C,  I0.  and  Vj;  In 
equation  (3-8)  Vv(r,  t)  should  be  evaluated  at  four 
different  time  Instants  to  build  a  Vc(f)  vector. 

The  simulation  results  are  surnnerized  in  table 
3-2.  The  nominal  values  of  I0,  Vy,  R,  C  are  0.2, 
0.1,  IK.  and  0.25  respectively. 
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IV.  CONCLUSIONS 


For  the  slmulatlon-before-test-approach  to 
fault  diagnosis,  we  naln  from  the  fact  that  most 
computation  can  be  done  by  off-line  computation, 
thus  greatly  reducing  the  repetitive  on-line  com¬ 
putation  associated  with  many  fault  diagnosis  algo¬ 
rithms.  From  a  practical  point  of  view,  the  eco¬ 
nomics  of  such  an  approach  are  extremely  attractive. 
Unfortunately,  the  slmulatlon-before-test  approach 
Is  subject  to  a  certain  degree  of  ambiqulty  Intro¬ 
duced  by  good  components  which  are  in-tolerance 
but  not  nominal. 


In  this  paper,  we  have  proposed  a  slmulatlon- 
before-test  algorithm  for  analog  fault  diagnosis. 

In  which  a  dlfferential-interpolatlve  technique  Is 
used  to  eliminate  the  ambiguity  caused  by  tolerance 
effects.  Our  approach  has  been  tested  with  satis¬ 
factory  results  In  both  the  linear  and  nonlinear 
cases.  In  fact,  for  the  linear  case,  the  approach 
provides  an  exact  Interpolation  for  f(c)  on  the 
axes,  and  thus  reduces  the  amount  of  slmulatlon- 
before-test  data  required  on  each  axis.  Although 
this  Is  not  true  for  nonlinear  case,  the  diagnosis 
results  are  still  very  attractive.  Of  course, 
occasional  errors  may  occur  when  the  good  compon¬ 
ents  are  too  far  out  of  tolerance.  This  phenomena 
is,  however,  expected  and  well  understood.  Indeed, 
the  difficulty  occurs  only  when  the  1st  order 
Taylor  series  approximation  Is  too  good.  Because 
this  phenomena  will  rarely  occur  in  the  real  world, 
we  believe  that  It  may  be  neglected  In  a 
practical  algorithm. 
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Abstract  of 


A  DATA  BASE  FOR  SYMBOLIC  NETWORK  ANALYSIS 


C.-c  Wu  AND  R.  Saeks 


Abstract 


Historically,  symbolic  network  analysis  has  been  motivated  by  the  prob¬ 
lems  of  circuit  design  and,  as  such,  the  emphasis  has  been  pilace<f  on 
quickly  and  efficiently  obtaining  a  symbolic  transfer  function  from  a 
given  set  of  circuit  specifications.  In  an  operational  or  maintenance 
environment,  however,  one  is  typically  given  a  prescribed  nominal  circuit 
and  desires  determine  the  effect  of  various  (possibly  large)  perturbations 
thereon.  This  is  the  case  in  a  power  system  where  one  is  given  a  fixed 
network  and  desires  to  determine  the  effect  of  proposed  modifications  thereto. 
Alternatively,  in  the  problem  of  analog  circuit  fault  diagnosis  one  desires 
to  simulate  the  effect  of  a  number  of  alternative  failures  to  compare  the 
simulated  data  with  the  observed  failure  data. 

In  such  an  operational  or  maintenance  environment  numerous  perturbations 
of  the  nominal  circuit  are  studied  and,  as  such,  significant  computational 
efficiencies  can  be  obtained  if  one  first  generates  a  data  base  !n  terms 
of  the  nominal  circuit  parameters  and  then  extracts  the  appropriate  symbolic 
transfer  function  from  the  data  base  each  time  a  different  symbolic  transfer 
is  required.  Of  course  the  benefit  to  be  achieved  via  such  an  approach  is 
dependent  on  the  size  of  the  data  base  and  the  ease  with  which  a  symbolic 
transfer  function  may  be  retrieved  therefrom. 

The  obvious  manner  in  which  to  generate  such  a  data  base  is  to  simply 
pre-compute  the  coefficients  of  all  required  symbolic  transfer  functions 
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and  store  them  in  the  data  base.  Retreival  from  such  a  data  base  is,  of 
course,  immediate  but  the  data  base  may  become  overly  large.  Indeed,  the 
number  of  transfer  functions  which  must  be  stored  is  Q(k^)  where  k_  is  the 
total  number  of  potentially  variable  circuit  parameters  and  £  is  the  maximum 
number  of  circuit  parameters  which  may  vary  simultaneously.  An  alternative 
approach  is  to  store  the  nominal  transfer  function  information  and  then  use 
Householder's  formula  to  compute  the  required  symbolic  transfer  functions. 
In  such  a  data  base  we  need  only  store  0(£  )  transfer  functions  where  £  is 
the  total  number  of  component  output  terminals  but  retreival  requires 
0(n^+p^)  multiplications  where  p  is  the  actual  number  of  circuit  parameters 
which  vary  simultaneously.  Since,  in  practice,  n_  >>  p  the  retreival  process 
requires  approximately  O(n^)  multiplications  and  is  dominated  by  the  large 
dimensional  matrix  multiplication  required  by  Householder's  formula  rather 
than  the  low  dimensional  inverse. 

In  the  present  paper  we  will  formulate  an  alternative  data  base  for 

2 

the  symbolic  transfer  functions  which  also  requires  0(£  }  entries,  but  for 
which  retreival  requires  only  Q(p^)  multiplications.  Since  p  is  typically 
small  this  is  tantamount  to  immediate  retreival. 
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Abstract  of 


DIAGNOSABILITY  OF  NONLINEAR  CIRCUITS  AND  SYSTEI1S 


Richard  Saeks, 

Alberto  Sangiovanni-Vincentelli 

AND  V,  VlSVANATHAN 


Abstract 


A  theory  for  the  diagnosability  of  nonlinear  dynamical  systems  is  develop¬ 
ed.  It  is  based  on  an  input-output  model  of  the  system  in  a  Hilbert  space 
setting.  A  necessary  and  sufficient  condition  for  the  local  diagnosability 
of  the  system,  which  is  a  rank  test  on  a  matrix,  is  derived.  A  simple 
sufficient  condition  is  also  derived.  It  is  shown  that,  for  locally  diag- 
nosable  systems,  there  exist  a  finite  number  of  test  inputs  that  are  suffic¬ 
ient  to  diagnose  the  system.  Illustrative  examples  are  presented. 
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ANALOG  FAULT  DIAGNOSIS  WITH  FAILURE  BOUNDS 


C.-C,  Wll,  K,  N  A  KAJIMA, 

C .  -L .  '.'ey  and  R.  Saeks 
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Abstract 


A  simulation-after-test  algorithm  for  the  analog  fault  diagnosis 
problem  is  proposed  in  which  a  bound  on  the  maximum  number  of  simul¬ 
taneous  failures  is  used  to  minimize  the  number  of  test  points  required. 
The  resultant  algorithm  is  applicable  to  both  linear  and  nonlinear 
systems  with  multiple  hard  or  soft  faults  and  can  be  used  to  isolate 
failure  up  to  an  arbitrarily  specified  "replaceable  chip  or  subsystem". 
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MULTIDIMENSIONAL  SYSTEM  THEORY 
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1.  Title  of  Investigation:  Multidimensional  System  Theory 

2.  Senior  Investigator:  J.  Murray  Telephone:  (806)  742-3506 

3.  JSEP  Funds:  Current  $25,875 

4.  Other  Funds:  Current 

5.  Total  Number  of  Professionals:  PI 1 s  I  (I  mo.)  RA‘s  1  (1/2  time 

6.  Summary: 

Although  most  research  in  the  image  processing  area  is  motivated  by  the 
computational  problems  associated  with  the  actual  image  processing  algorithms, 
progress  in  the  area  has  also  been  limited  by  the  cost  of  designing  an 
efficient  2-D  signal  processing  algorithms.  Indeed,  in  many  image  processing 
applications  simple  non-recursive  algorithms  are  used  in  lieu  of  far  superior 
recursive  algorithms  because  of  the  prohibitive  design  costs  associated  with 
the  recursive  algorithms.  As  such,  this  work  unit  is  directed  at  the  problem 
of  developing  efficient  design  techniques  for  stable  2-0  digital  siqnal  pro¬ 
cessors.  In  this  endeavor  we  have  developed,  and  reported  upon,  a  2-dimensional 
design  algorithm  based  on  a  spatially-invariant  symmetric  half-plane  recursive 
model  and  are  in  the  process  of  developing  an  algorithm  which  uses  a  2-D  fre¬ 
quency  domain  model  for  a  periodically  varying  system  originally  introduced 
by  Jury  and  Mullin.  The  latter  model  completely  eliminates  the  analytical 
difficulties  classically  associated  with  2-dimensional  design  but  this  is 
achieved  at  the  price  of  working  with  high  dimensional  matrices.  As  such, 
the  computational  cost  of  the  design  process  based  thereon  is  prohibitive. 

We,  however,  believe  that  the  structure  of  the  matrices  which  arise  in  this 
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model  can  be  exploited  to  formulate  a  class  of  efficient  design  algorithms 
and  are  presently  investigating  this  possibility. 
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DIGITAL  FILTERING"  * 

John  Murray 

Department  of  Electrical  Engineering 
Texas  Tech  University 
Lubbock,  TX  79«09  USA 


Abstract 

A  new  approach  to  two-dimensional  digital  filtering  is  presented. 

This  approach  is  based  on  a  one-dimensional  periodically  time-vary¬ 
ing  model  which  accurately  reflects  the  scanning  process  Inherent  in 
most  recursive  multidimensional  signal  processing.  Time-varying 
models  are  in  general  intractable;  however,  periodically  time-vary¬ 
ing  discrete-time  models  such  as  occur  in  the  present  case  are  essen¬ 
tially  equivalent  to  multi-input,  multi-output,  one-dimensional  time- 
invariant  systems.  They  therefore  permit  the  application  or  classi¬ 
cal  techniques  to  design  and  analysis  problems.  Two  further  advan¬ 
tages  of  the  approach  are  the  fact  that  it  bypasses  the  problem  of 
boundary  conditions,  and  that  allowing  time-variation  gives  a  degree 
of  design  flexibility  not  available  in  the  shift-invariant  case.  Some 
possible  design  methods  using  these  time-varying  ideas  are  presented. 


1.  INTRODUCTION 

The  general  field  of  two-dimensional  data 
processing  has  been  the  subject  of  exten¬ 
sive  investigation  during  the  past  several 
years.  The  simplest  situation,  and  that 
which  has  received  most  attention  in 
theoretical  work,  is  the  shift-invariant 
case,  where  the  processing  operations  are 
assumed  to  commute  with  translations  in 
both  directions.  Much  of  the  effort  in 
this  direction  has  been  devoted  to  the 
analysis  and  design  of  two-dimensional 
digital  filters.  In  studying  this  work, 
one  quickly  becomes  aware  of  a  basic  di¬ 
chotomy  between  finite  impulse  response 


well  understood  and,  although  there  are 
problems  in  some  areas  (such  as  uniform 
approximation)  and  difficulties  with  the 
data-rates  and  amount  of  computation  re¬ 
quired  (which  are  inevitable  in  two-dimen¬ 
sional  work)  the  use  of  FIR  filters  is 
almost  routine  by  now  in  two-dimensional 
data  processing.  The  design  and  use  of 
HR  filters,  on  the  other  hand,  very  often 
presents  serious  difficulties.  In  what 
follows,  we  will  point  out  some  theoretical 
problems  associated  with  shift -invariant 
IIR  filtering,  and  show  how  the  theory  of 
time-varying  systems  offers  a  (theoretical) 
resolution  of  these  problems. 


(FIR)  filters,  which  are  usually  imple¬ 
mented  nonrecursively ,  and  Infinite  im¬ 
pulse  response  ( IIR) 'filters ,  which  are 
usually  implemented  recursively.  The 


2.  IIR  FILTERING  AND  CAUSALITY 

2.1.  Two-Dimensional  Ccusality 
In  one-dimensional  data  processing,  the' 
dimension  in  question  is  usually  time,  and 


theory  and  design  of  FIR  filtering  are 

This  research  suooorted  in  part  by  the  Joint  Services  Electronic  Program  of  Texas 
Tech  University  under  CNR  Contract  75-C-1136. 
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so  there  Is  a  natural  concept  of  causality. 
In  -two  dimensions,  the  situation  is  more 
compdlcated .  In  many  cases,  the  dimensions 
are  both  spatial,  and  so  there  is  no  in¬ 
trinsic  causality.  Even  in  the  case  where 
one  dimension  is  time  and  the  other  is 
space  (e.g.,  transducer  arrays)  one  will 
often  have  the  data  already  recorded,  and 
so  causality  will  not  oe  significant.  How¬ 
ever,  in  this  latter  case,  the  appropriate 
causality  is  a  two-dimensional  "symmetric 
half-plane"  causality  -  but  it  should  be 
enpr.aslzed  that  this  causality  is  not  in¬ 
trinsic  to  the  data;  it  is,  rather,  a  pro¬ 
perty  of  the  processing  used.  At  this 
point,  a  distinction  must  be  made  between 
FIR  and  IIS  filters. 

2.2.  One-Dimensional  Causality. 

As  was  mentioned  in  the  previous  paragraph, 
there  may  or  may  not  be  a  notion  of  causal¬ 
ity  inherent  in  two-dimensional  data.  How¬ 
ever  the  data  must  be  processed  .in  time, 
and  this  introduces  a  concept  of  causality 
which  (at  least  with  classical  processing) 
is  one-dimensional.  How  compatible  this 
is  with  two-dimensional  causality  in  the 
data  will  depend  on  the  specific  problem. 

In  the  context  of  nonrecursive  processing, 
the  order  in  which  the  data  are  processed 
i3  irrelevant,  and  so  no  real  conflict  can 
arise;  but  the  very  nature  of  recursive 
processing,  in  which  the  output  at  a  point 
depends  on  the  output  at  previous  points, 
demands  that  an  order  be  specified.  This 
order  is  almost  always  taken  to  be  a 
"scanning"  order,  in  which  a  line  is  pro¬ 
cessed  from  left  to  right,  followed  by  the 
next  line  below  it,  etc..  We  will  assume 
that  this  is  the  order  in  which  processing 
occurs  for  the  remainder  of  this  paper. 

There  is,  however,  an  inconsistency  between 
Shis  ordering  and  the  assumption  of  shift- 
invariance.  While  not  serious  in  practice, 
it  does  indicate  that  the  theory  may  run 
into  difficulties.  This  inconsistency 


arises  from  the  fact  that  the  above  order¬ 
ing  requires  that  the  horizontal  lines  be 
finite,  while  the  use  of  shift-invariance 
requires  in  principle  that  they  be  infinite 
in  at  least  one  direction.  A  first  approx¬ 
imation  to  resolving  this  difficulty  is 
given  by  the  McClellan  transform^  which 
essentially  concatenates  the  lines  into 
one  long  line,  and  applied  one-dimensional 
linear  shift-invariant  theory  to  the  re¬ 
sulting  signal;  this  however  has  the  dis¬ 
advantage  in  principle  of  treating  the 
edges  of  the  image  in  the  same  way  as 
points  in  the  interior.  (In  practice,  of 
course,  one  applies  suitable  boundary  con¬ 
ditions).  It  is  clear  that  the  model 
which  most  accurately  reflects  the  actual 
processing  being  done  in  this  situation  is 
a  one-dimensional  periodically  time-vary¬ 
ing  model. 

3.  PERIODICALLY  VARYING 
DISCRETE-TIME  SYSTEMS 

Time-varying  systems  in  general  are  ex¬ 
tremely  difficult  to  work  with;  It  is 
therefore  fortunate  that  one  of  the  few 
classes  of  time-varying  systems  for  which 
a  complete  closed-form  theory  exists  is 
the  class  of  periodically  varying  discrete- 
time  systems  .  ^  ^  *  **  ^  There  are  several 
versions  of  this  theory,  but  they  all  des¬ 
cribe  scalar-input  scalar-output  systems 
whose  period  of  variation  is  n  sampling 
periods  by  nxn  time-invariant  transfer 
matrices.  We  proceed  to  outline  one 
version. 


Input  and  output  sequences  are  represented 
by  the  transform: 


where 
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so  that  the  transform  of  a  scalar  signal 
Is  an  n-dlmenslonal  column  vector.  The 
input-output  relationship  of  a  scalar 
system  of  the  type  under  consideration  is 
then  given  by 

Y(Z )  =  P(Z)U(Z) 

where  P(Z)  is  an  nxn  matrix  of  rational 
functions  of  the  form 

PiJ(Z)  =  z\j(zn) 

where 

i-j  i  *  J 

n+i-j  i  <  j 


immense  difficulty;  namely,  n  must  be 
taken  to  be  the  number  of  points  in  a  line 
of  data,  and  so  is  usually  very  large. 

This  implies  that  nxn  matrices  of  rational 
functions  will  be  totally  intractable.  Be¬ 
fore  discussing  this  problem,  however,  we 
will  outline  how  the  two-dimensional  situ¬ 
ation  relates  to  the  periodically  time- 
varying  approach.  From  now  on,  n  will  de¬ 
note  the  number  of  points  in  a  horizontal 
line . 

As  an  example,  we  consider  a  mask  of  the 
form 


For  example,  a  memoryless  time-varying 
gain  a(m)  is  given  by  the  diagonal  matrix 

a(o) 

a(l)  0 

0  a  ( n )  _ 

while  the  unit  delay  is  given  by  the 
matrix 


all  a10  al,-l 
a01  a00 

where  the  (0,0)  subscript  indicates  the 
point  at  which  processing  is  occurring. 
Specifically,  if  the  above  were  the  input 
mask  of  a  purely  nonrecursive  filter,  the 
output  would  be  the  convolution  of  the 
above  array  with  the  input  array. 


0  0 . z 

Z  0. 

0  Z  •  . 

c  6. . ! : .z  o  . 

_ 

(It  follows  that  in  this  representation, 
time-invariant  systems  are  described  by 
circulant  matrices). 

It  can  be  shown  that  by  using  the  above 
transformation,  systems  which  vary  with 
a  period  ec.uai  to  r.  sampling  periods  can 
be  treated  precisely  as  if  they  wer'-  r- 
ir.tut  -output  time-invariant  systems; 
in  particular,  such  a  system  is  stable  if 
and  only  if  the  matrix  ?(?.)  is  invertible 
for  all  2  with  |Z|  r  1. 

u  APPLICATION  TO  TWO-DIMENSIONAL 
PROCESSING 

In  striving  the  thecry  outlined  in  the 
:  re  vious  ser'ion  *  o  *  wo-dl  rnen.si  o.nal  data 
:  r'f^dsir.g,  we  immediately  encounter  an 


The  two-dimensional  Z-transform  of  the 
above  array  is  given  by 

p(Z1(Z2)  =  aoc+a01Z.  (an  Va10+1l,-iZ21)2r 

If  one  uses  the  McClellan  transform,  that 
is,  concatenates  the  rows  and  regards  the 
filter  as  a  one-dimensional  shift -invariant 
filter,  the  resulting  one-dimensional  Z- 
transform  is 

p(Z)  *  a00+a01Z +  ^ al 1 Z+ai0+al , -1 2  ^2 


To  put  this  in  clearer  perspective,  we 
will  write  it  in  terms  of  our  time-varying 
model:  the  resulting  transfer  operator  is 

a00+a102  ,ai,-lzn 

aoi2+an2n+!  ■  ° 


P(Z)- 


-lz' 
*■ »  1 


n-1 


a..+a, .2° 

00  10 


<2) 


-a  circulant  matrix,  as  expected. 


203 


Ip  practice,  of  course  one  does  not  imple¬ 
ment  this  as  a  strictly  shift-invariant 
one-dimensional  filter;  one  puts  in  appro¬ 
priate  boundary  conditions  when  a  boundary 
is  crossed.  It  is  easy  to  see  that,  if 
zero  boundary  conditions  are  assumed,  the 
transfer  operator  which  describes  the 
actual  processing  performed  is 


?(2) 


^Q^IO2 

a01Z+aUZ' 


,n+ 1 


al,-Iz 


..  0,  0 


(3) 


-a  Toeplitz,  rather  than  a  clrculant,  ma¬ 
trix,  since  the  processing  is  now  (slight¬ 
ly)  time -varying. 

While  this  may  seem  like  a  trivial  modifi¬ 
cation,  it  can  in  some  situations  have 
serious  theoretical  effects.  For  instance 
if  one  specializes  to  a  quarter-plane 
filter  by  setting  a^  =  Q,  the  time- 
invariant  stability  condition  is  (from 
(1)> 

a00*a01Z+ai0Zn+auZn+1t0  for  |Z|  s  1.  (4) 

However,  if  one  uses  the  time-varying 
version  (3)  in  this  situation,  the  result¬ 
ing  matrix  is  a  lower  triangular  Toeplitz 
matrix  whose  diagonal  element  is 


aoo+aio2 


and  so  the  stability  condition  in  this 
case  is 

a00a1QZn*  0  for  |Z|  «  1  (5) 

•which  is  substantially  different  from  (4). 

(The  problem  here  i3  that  if  (4)  is  not 
satisfied  and  ;5>  is,  the  matrix  (3)  will 
be  invertible  for  all  | Z !  (  1,  but  the  in¬ 
verse  matrix  may  contain  extermely  large 
elements,  and  so  while  the  system  is 
stable' in  principle,  it  will  be  unstable 
in  practice . ) 


5.  POSSIBLE  DESIGN  APPROACHES 
As  was  pointed  out  previously,  a  major 
problem  blocking  any  realistic  applica¬ 
tion  of  the  above  approach  is  the  size  of 
the  matrices  involved.  This  problem  is 
not  completely  hopeless,  however,  for  the 
following  reasons.  Firstly,  the  matrices 
which  occur  are  banded  with  bandwidth 
equal  to  the  horizontal  width  of  the  fil¬ 
ter  array.  Thus  for  reasonably-sized  fil¬ 
ters,  the  matrices  are  both  sparse  and 
structured.  Secondly,  one  does  not  nor¬ 
mally  want  to  design  filters  which  are 
wildly  shift-varying;  it  is  usually  de- 
sireable  to  have  a  filter  which  is  approx¬ 
imately  shift-invariant  far  from  the 
boundaries.  In  this  connection  the  change 
from  clrculant  to  Toeplitz  which  occurred 
in  the  previous  section  (as  a  result  of 
inserting  boundary  conditions)  comes  to 
mind,  and  suggests  that  "displacement  rartv’ 
ideas  ^  may  have  some  relevance  here. 

Another  possibility  is  to  replace  the 
shift  in  the  horizontal  direction  with 
some  other  "basic  dymanlcal  operator"  and 
to  design  in  terms  of  this  operator. 

The  most  obvious  choice  is  a  descrete 
approximation  to  the  derivative,  which 
has  the  following  advantages: 

1)  It  is  local,  and  so  can  be  calculat¬ 
ed  efficiently. 

2)  At  the  boundaries,  it  can  be  calcu¬ 
lated  as  a  one-sided  derivative,  and 
so  avoids  the  problem  of  boundary 
values . 

The  simplest  operator  which  is  a  discrete 
approximation  to  the  derivative  and  en¬ 
joys  both  of  these  advantages  is  the 
matrix 

f-i.  -  o.  0 . *T 


o  -i,  i] 
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where  a2>  a^,...  are  arbitrary  but  fixed. 

The  algebra  generated  by  T,  the  constants, 
and  the  vertical  shift  is  then  commutative, 
and  so  offers  the  possibility  of  a  tract¬ 
able  theory.  The  major  remaining  question 
concerns  the  choice  of  the  constants  aj, 
a, .  . .  While  this  is  currently  under  inves¬ 
tigation,  and  no  definitive  results  have 
been  established,  the  following  points  may 
be  noted. 

1) If  one  designs  a  filter  in  terms  of  the 
derivative  in  the  horizontal  direction 
and  the  shift  in  the  vertical  direction 
one  can  then  replace  the  derivative  by 
the  discretized  version  T. 

2) Since  the  spectrum  of  the  derivative  is 
the  entire  imaginary  axis,  while  the 
spectrum  of  T  can  be  adjusted  by  vary¬ 
ing  the  a^,  this  approach  yields  the 
possibility  of  choosing  the  approximation 
so  that  the  discretized  filter  is  stable 
even  when  the  original  designed  filter 
is  unstable,  for  example  by  concentrat¬ 
ing  the  spectrum  of  T  on  a  few  points. 

To  be  specific,  if  the  denominator  of 

n  . 

the  original  filter  is  £  f,(D)ZJ, 

J=o  J 

where  D  is  the  derivative  in  the  hori¬ 
zontal  direction,  and  Z  the  shift  in 
the  vertical  direction,  the  stability 
condition  for  the  discretized  filter 
will  be 

I  f,(*<  )ZJ  4  0  ,  V  1 2|  s  1  (7) 

J*o  J  * 

for  all  eigenvalues  x,  of  T.  Thus  the 
smaller  the  set  of  eigenvalues,  the 
less  stringent  the  stability  conditions. 

3) For  practical  stability,  if  the  eigen¬ 
value  x^  is  associated  with  a  large 
Jordan  block,  the  theoretical  condition 
(7)  must  be  replaced  by 

•"l 

l  f,(w)Z-  +  0,  VIZI  *  1, 

J«o  J 

for  all  w  such  that  |w  -  x,|  $  1. 

Thus  using  large  Jordan  blocks  can  act¬ 
ually  make  the  stability  conditions 


more,  rather  than  less,  stringent.  We 
note  in  passing  that  T  always  contains 
a  two-dimensional  Jordan  block  with 
zero  eigenvalue. 

4)The  previous  two  paragraphs  taken  to¬ 
gether  imply  that  the.  minimal  polynomi¬ 
al  should  be  of  low  order.  This  has  the 
further  benefit  of  ensuring  that  the 
support  of  the  filter  in  the  horizontal 
direction  is  of  limited  width;  achieving 
this  is  a  major  problem  in  the  shift- 
invariant  design  approach. 

6.  CONCLUSIONS 

We  have  pointed  out  some  theoretical  in¬ 
consistencies  between  the  assumption  of 
two-dimensional  shift-invariance  and  the 
usual  scanning  model  employed  in  the  pro¬ 
cessing  of  two-dimensional  data.  As  a 
remedy  for  this  inconsistency ,  a  one-dlmen- 
sional  periodically  time-varying  model  has 
been  proposed;  this  also  has  the  advantage 
of  not  requiring  boundary  conditions.  Fin¬ 
ally,  two  possible  design  approaches  have 
been  mentioned. 
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LUMPED-DISTRIBUTED  NETWORKS  AND  DIFFERENTIAL-DELAY  SYSTEMS 
J.  Murray 

1.  INTRODUCTION.  In  this  paper  we  will  consider  some  known 
properties  -of  differential-delay  systems  and  their  relationship 
to  the  lumped-distributed  networks  studied  by  classical  circuit 
theorists.  The  two  theories  are  fundamentally  the  same,  but  the 
emphasis  is  different;  in  particular,  the  first  question  asked 
in  the  system-theoretic  approach  tends  to  be  about  stability, 
while  the  circuit-theorists'  primary  Interests  have  had  to  do 
with  passivity.  A  further  major  difference  is  that  the  system- 
I'-eorists  are  concerned  with  the  (infinite-dimensional)  state 
spaces  associated  with  these  systems,  while  the  circuit-theorists 
tend  to  ignore  the  state  space,  and  concentrate  entirely  on 
inpjt-output  properties. 

Actually,  the  similarities  and  differences  between  these  two 
fields  (and  others)  have  been  treated  recently  (and  excellently) 
by  Kamen  [1],  The  present  paper  gives  a  different  viewpoint, 
however,  being  an  analytic  approach  in  contrast  to  the  algebraic 
approach  in  [1],  Further,  it  considers  only  input-output  proper¬ 
ties  of  systems,  and  may  be  considered  as  a  study  of  the  simplest 
case  of  the  convolution  algebra  approach  in  [2,3].  It  is  hoped 
that  the  following  discussion  will  give  some  intuition  for  the 
last-mentioned  approach,  and  in  particular  for  the  relationship 
between  it  and  the  classical  circuit-theorists’  use  of  several 
complex  variables  to  model  lumped-distributed  networks. 

2.  DIFFERENTIAL-DELAY  SYSTEMS:  ALGEBRAIC  ASPECTS.  The  sub¬ 
ject  matter  of  this  section  has  been  extensively  treated  in  many 
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places;  we  give  a  quick  summary  simply  to  fix  ideas.  We  will 
work  with  the  simplest  possible  case  of  differential-delay  sys¬ 
tems,  namely,  the  case  where  all  the  delays  are  integral  multi¬ 
ples  of  one  fundamental  delay;  time-units  are  assumed  to  be 
normalized  so  that  this  fundamental  delay  is  of  length  one.  The 
input-output  mapping  for  such  a  system  (assuming  BIBO  stability) 
is  given  by  convolution  with  an  expression  of  the  form 

H(t)  -  F(t)  +  Z  K-  6(t-n)  (1) 

i*— 

where  F(t)  is  a  matrix-valued  function  in  L,  (-•,»)  and  the 

K.  are  matrices  with  21  IM  <  «•;  this  is  the  simplest  case 

1—  1 

of  the  algebras  studied  by  Callier  and  Oesoer  [2,3]. 

For  conceptual  purposes,  we  have  included  non-causal  systems 
in  the  above;  in  the  real-world  case  of  causal  systems,  we  have 

F(t)  -  0,  t  <  0 
and 

1C,  *  0,  i  <  0  . 

For  a  system  composed  of  a  finite  number  of  differentiators 
and  integer  delays,  the  Fourier  transform  of  a  transfer  operator 
of  the  above  type  is  well  known  to  be  of  the  form 

R(s,e's)  (s  *  iw) 

where  R  is  a  rational  matrix  function  of  two  complex  variables. 
Since  the  functions  s  and  e's  are  algebraically  independent, 
R  is  unique.  Also,  since  the  transform  of  a  composition  (con¬ 
volution)  is  the  product  of  the  transforms,  it  follows  that  an 
algebra  of  input-output  operators  of  the  type  (1)  arising  from 
finite  systems  is  isomorphic  to  a  subalgebra  of  the  field  of 
rational  functions  in  two  variables  (over  B  or  I,  as  appro¬ 
priate).  The  realization  problem  for  differential -del ay  systems 
consists  of  identifying  this  subalgebra,  and  has  been  treated  in 
several  places  (e.g.  [4]).  Since  it  is  not  our  purpose  here  to 
treat  either  the  algebraic  aspects  of  these  systems  or  the 
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problems  of  their  realization,  we  merely  repeat  that  the  alge¬ 
braic  treatment  rests  on  the  fact  that  there  is  an  algebra  iso¬ 
morphism  between  the  appropriate  set  of  input-output  operators 
and  an  algebra  of  rational  functions  in  two  variables.  Thus  one 
may  say  that  from  the  algebraic  point  of  view,  systems  of  the 
type  (1)  can  be  treated  as  two-dimensional. 

3.  DIFFERENTIAL-DELAY  SYSTEMS:  ANALYTIC  ASPECTS.  Much  of 
the  power  of  transform  methods  in  electrical  engineering  arises 
not  simply  from  the  algebraic  isomorphism  between  a  convolution 
algebra  and  a  function  algebra  which  these  transforms  define, 
but  from  the  more  "analytic"  properties  of  the  isomorphism;  e.g., 
the  relationship  between  pole-location  and  stability,  or  between 
passivity  and  the  positive-real  property.  Both  of  these  rela¬ 
tionships  will  be  discussed  below  as  they  apply  to  the  present 
class  of  operators.  The  most  obvious  analytic  property  of  these 
operators  is  that  they  have  a  norm  defined  by 

|H|  -  y  I  F(t)  |dt  +  £  |K.|  .  (2) 

For  convenience,  we  will  restrict  ourselves  to  scaler-input, 
scalar-output  systems  from  here  on;  this  case  contains  all  the 
essential  features  which  we  wish  to  discuss.  With  this  assump¬ 
tion,  it  is  easy  to  check  that  the  operators  of  the  form  (1), 
with  the  norm  (2),  form  a  commutative  Banach  algebra  which  we 
will  denote  by  8.  It  is  therefore  natural  to  try  to  comoute 
the  Gelfand  spectrum  of  this  algebra,  and  see  if  it  can  contri¬ 
bute  to  the  understanding  of  these  operators.  This  can  be  done 
in  various  ways,  but  one  of  the  most  natural  is  to  begin  with 
the  spectrum  of  L^ (-*,»),  and  investigate  its  behaviour  under 
the  transformation  discussed  in  the  previous  section,  which 
changed  the  original  one-dimensional  problem  into  a  two- 
dimensional  problem.  This  transformation  is  given  by  the  map¬ 
ping 

f:  <t  -  C2 
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defined  by 

f(s)  •  (s,e*s)  . 


In  order  to  avoid  working  with  points  at  infinity,  it  is  conven¬ 
ient  to  take  a  bilinear  transform  of  the  first  coordinate,  and 
work  instead  with  the  mapping 

f:  t  -  t2 


defined  by 


This  has  the  advantage  that  f  maps  the  right  half-plane 
into  the  unit  bidisk,  U2,  and  maps  the  imaginary  axis  (which 
is  the  spectrum  of  l.  (—,«•))  into  the  distinguished  boundary 
T2,  of  U2.  (He  will  use  the  notation 


and 


U  *  {Z  €  c||Z|  <  1} 

0  «  {Z  e  t||Z|  s  i> 

T  -  (Z  €  |||Z|  •  1) 

U2  «  U  X  U,  etc.  .  .  ) 


Since  3  has  an  identity,  its  spectrum  is  compact,  and  it  is 
natural  to  conjerture  that  this  spectrum,  o(S),  is  the  closure 
of  the  image  of  the  imaginary  axis.  This  is  in  fact  the  case 
[S].  The  image  is  defined  by 


w  e  * 


Representing  the  torus  as  a  square  with  its  opposite  edges 
identified  in  the  usual  way,  we  can  draw  an  approximation  to 
c(8)  as  in  Fig.  1.  It  consists  of  the  circle  8j  *  i  together 
with  a  line  which  is  asymptotic  to  this  circle. 

As  mentioned  in  the  introduction,  one  of  the  most  important 
Questions  in  system  theory  is  stability.  In  virtually  every 
situation  ir.  wnich  it  arises,  input-output  stability  is  equiva¬ 
lent  to  some  operator  having  a  bounded,  causal  inverse.  It  is 
ez r  this  reason  that  transforms  are  useful;  invertibility  of  an 
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element  in  a  Banach  algebra  is  equivalent  to  invertibility  of  its 
(Gelfand)  transform  at  every  point  in  the  spectrum.  However, 
the  spectrum  in  Fig.  1  is  that  of  8  and  so  nonvanishing  on  this 
set  implies  only  the  existence  of  an  inverse— not  necessarily  a 
causal  inverse.  There  are  two  equivalent  ways  of  deciding 
whether  or  not  a  causal  inverse  exists;  the  first  ("Hurwitz") 
approach  is  to  find  the  spectrum,  S,  of  the  causal  subalgebra 
of  8,  and  check  for  nonvanishing  on  this  set.  (The  second 
C'Nyquist")  approach  will  be  discussed  in  the  next  section.) 

Exactly  as  in  the  case  of  the  spectrum  of  8,  one  is  led  to 
conjecture  that  S  (the  causal  spectrum)  is  the  closure  in  0 
of  the  image  of  the  half-plane  (the  spectrum  of  [(),•))  under 
f.  Again,  this  is  the  case.  While  we  can  no  longer  draw  a  pic¬ 
ture  of  this  spectrum,  we  can  get  a  good  idea  of  what  kind  of 

2 

object  it  is.  Its  intersection  with  U  is  the  image  of  a  one- 

(complex)  dimensional  manifold  under  a  proper  holomorphic  map, 

2 

and  so  is  a  two-dimensional  analytic  subset  of  U  .  The  inter- 

2 

section  of  S  with  the  boundary  of  U  consists  of  two  parts; 
the  spectrum  of  8  described  in  the  previous  section,  and  the' 
disk 

{(1,Z2)||Z2I  <  1}  . 

The  upshot  of  all  this  is  that  the  spectrum  is  a  very  small 
_2 

subset  of  U  .  While  nonvanishing  of  a  function  on  a  one¬ 
dimensional  analytic  subset  of  UZ  together  with  nonvanishing 

2  -2 
on  T  can  imply  nonvanishing  on  all  of  U  [6,7],  o(S)  is 

2 

much  too  small  a  subset  of  T  for  any  such  conclusion  to  be 
possible  in  this  case.  Thus  we  are  led  to  the  conclusion  that 
stability  of  a  system  is  equivalent  to  the  nonvanishing  of  a 
two-variable  rational  function  R(Z^,Z2)  on  the  fairly  compli¬ 
cated  one -dimensional  subset,  S,  of  1^;  or  equivalently, 
to  the  transcendental  function 


being  bounded  away  from  zero  in  the  half-plane.  In  either 
case,  from  the  analytic  point  of  view,  one  has  a  strictly 
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one-dimensional  problem.  The  two-variable  approach  appears 
merely  as  a  device  which  gives  one  a  convenient  way  of  calculat¬ 
ing  the  spectnm  of  the  appropriate  convolution  algebra. 

4.  DIFFERENTIAL-DELAY  SYSTEMS:  A  TOPOLOGICAL  ASPECT.  We 
digress  from  the  main  purpose  of  the  paper  in  this  section  in 
order  to  discuss  the  "Nyquist"  approach  to  stability  mentioned 
above.  The  essence  of  this  approach  is  that  instead  of  looking 
for  nonvanishing  of  the  transform  on  a  spectrum  bigger  than  o(B) 
one  looks  for  conditions  (in  addition  to  nonvanishing)  on  the 
behavior  of  the  transform  on  o(8)  itself.  The  classic  case  of 
this  is,  of  course,  the  Nyquist  criterion  Itself,  where  one 
demands  that  the  transform  in  question  does  not  vanish  on  the 
imaginary  axis,  and  in  addition  that  the  image  of  the  imaginary 
axis  under  the  transform  does  not  encircle  0.  In  other  words, 
one  associates  an  index  with  the  operator  in  question  and,  assum¬ 
ing  that  the  operator  is  invertible,  demands  that  the  index  be 
zero  for  causal  invert! bility.  (We  are  assuming  here  that  the 
original  operator  is  itself  a  bounded,  causal  operator).  In- the 
case  of  the  Nyquist  criterion  itself,  the  index  is  an  integer, 
but  one  can  not  expect  this  to  be  true  in  general.  The  most  one 
can  expect  is  that  the  index  will  take  its  values  in  seme  (pos¬ 
sibly  partially  ordered)  group.  Since  it  is  known  (for  fairly 
general  convolution  algebras  over  H)  that  a  causal  invertible 
element  a  has  a  causal  inverse  if  and  only  if  a  is  in  the 
connected  component  of  the  identity  in  the  group  of  invertibles 
in  the  algebra  (see  [8]),  the  appropriate  "Index  Group"  here  is 
the  quotient  group:  Inverti bles/component  of  the  identity. 

(For  an  arbitrary  3anach  algebra,  this  group  is  actually  known 
as  the  abstract  index  group  of  the  algebra  [9]). 

This  would  be  of  little  use  were  it  not  for  the  fact  that  the 
structure  of  this  grouo  is  known  for  conmutative  Banach  algebras; 
for  such  an  algebra,  the  abstract  index  group  is  given  by 

V  ] 

H  (spectrum,  TL) 

the  first  Cech  cohomology  group  of  the  spectrum  of  the  algebra. 
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simpler  proof  using  the  cascade  loading  formula  is  given  in  [13], 

there  appears  to  be  little  point  in  making  the  above  discussion 

rigorous.  The  important  point  is  that  whereas  stability  Imposes 

restrictions  on  the  behavior  of  the  two-variable  transform  only 
2  2 

on  thin  subsets  of  T  and  U  ,  passive  synthesis  imposes  con¬ 
straints  over  the  entirety  of  both  sets.  From  any  point  of  view, 
the  passive  synthesis  problem  is  two-dimensional. 

6.  STABILITY  FOR  VARIABLE  DELAYS.  It  is  clear  from  the  pre¬ 
vious  section  that  if  one  considers  the  stability  of  a  system  for 
all  lengths  of  delay  time,  one  will  have  a  two-dimensional  prob¬ 
lem.  Various  results  can  be  derived  using  this  approach  [14]. 

As  an  example  we  have: 

PROPOSITION:  Suppose  a  system  R  is  composed  of  a  finite 
number  of  differentiators  and  delays  of  equal  length  a,  so  that 
its  tuo-vcriabl*  transfer  function  is  rational: 

P(Z,,Z2) 

R<2rV  -  raja? 

2 

Assume  that  R  has  no  indstsrminacies  on  T  .  If  the  following 
conditions  are  satisfied: 

i )  There  exists  a  number  M  such  that  R  is  stable  for 
ail  a  >  M 

ii)  Q(1,Z2)  f  0.  |z2]  <  | 

iii)  R  is  stable  for  a  ■  0, 
then  R  is  stable  for  all  a  >  0. 

Proof . 

u  c(8)  v  {{ 1 ,e’9) |0  <  6  <  T}  *  T2 
a>M  a 

2 

so  that  the  hypotheses  imply  that  0(Z^,Z2)  has  no  zeros  on  T  .. 
Condition  iii)  implies  that  0(Z^,1)  f  0  for  |Z^|  <1.  and 
this  together  with  condition  ii)  implies  that  0(2^  ,Z2)  f  0  for 
;z1 1  <  1,  1Z2|  <  1  [6,7].  It  follows  irmediately  that  R  is 

stable  for  all  a  >  0. 

Condition  iii)  is  actually  unnecessary  here;  it  can  be  elimi¬ 
nated  by  a  slightly  more  sophisticated  argument. 
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with  integer  coefficients.  It  is  intuitively  dear  (and  not  dif¬ 
ficult  to  prove)  that 

(o(8)  ,2)  . 

Thus  in  the  present  situation,  stability  requires  that  two 
distinct  integer  indices  be  zero,  rather  than  one  as  in  the  clas¬ 
sical  Nyquist  criterion.  This  is  not  surprising— intuitively  one 
might  expect  to  have  one  index  for  each  "independent"  kind  of 
delay  element.  While  one  can  get  some  feeling  for  these  indices 

VI 

by  examining  the  generators  of  H  (o(8) ,  7Z) ,  it  must  be  admitted 
that  this  is  by  no  means  the  best  way  of  actually  calculating  the 
indices  for  any  given  element  of  8.  In  fact,  the  problem  of 
calculating  these  indices  for  elements  of  algebras  considerably 
more  general  than  8  has  already  been  solved  [10,11]  in  a  much 
more  straight-forward  manner  than  that  discussed  above. 

However,  the  above  discussion  does  bring  out  a  numbei  of 
points.  The  major  one  is  that  the  stability  of  lumped-distrib¬ 
uted  systems  is  associated  with  the  algebraic  topology  of  the 
spectrum  of  the  appropriate  Banach  algebra.  Further,  the  impor¬ 
tant  topological  entity  is  the  first  £ech  cohomology  group— and 
we  note  that  in  the  case  of  c(5)  this  is  not  the  same  as,  e.g., 
the  first  singular  cohomology  group  (in  contrast  with  the  classi¬ 
cal  case).  Thirdly,  one  can  tell  simply  by  looking  at  the  spec¬ 
trum  what  kinds  of  conditions  are  needed  for  stability— in  this 
case,  that  two  integers  vanish.  Finally,  since  H  (o(8 ),Z)  * 
u}  ,  one  car  say  in  some  vague  sense  that,  from  the  topological 
or  index  point  of  view,  the  stability  problem  in  8  is  two- 
dimensional  . 

5.  LUMPED-DISTRIBUTED  CIRCUITS:  PASSIVE  SYNTHESIS.  As  was 
mentioned  in  the  introduction,  the  treatment  of  circuits  consist¬ 
ing  of  lumped  elements  and  commensurable  transmission  lines  is' 
formally  similar  to  that  of  differential-delay  systems.  There 
are  two  major  differences  however;  in  the  first  olace,  the  func¬ 
tions  involved  are  input  parameters  (impedance,  admittance,  or 
scattering)  rather  than  transfer  functions  (we  will  again  confine 


.ft  >. 


216 


J.  MURRAY 


our  attention  to  the  scalar  case):  secondly,  the  really  inter¬ 
esting  problem  is  passive  synthesis. 

The  purpose  of  this  section  is  to  give  some  feeling  for  the 
fact  that,  in  contrast  with  the  stability  problem,  the  problem 
of  passive  synthesis  is  genuinely  two-dimensional. 

To  this  end,  suppose  that  we  are  given  a  two-variable  rational 
function  R(Z.j,Z2)  and  that  we  wish  To  synthesize  a  passive  one- 
port  whose  scattering  parameter  is 

p(s)  «  R(|^|  ,  e“s)  . 

Exactly  as  in  the  classical  case,  it  is  necessary  that  p  be 
bounded  real  on  the  right  half-plane  or,  equivalently,  that  R 
be  bounded  real  on  the  set  S  discussed  in  section  3.  However, 
if  a  circuit  devoid  of  sources  is  constructed  from  lumped  com¬ 
ponents  and  unit  delay  lines,  it  remains  passive  when  delay  lines 
of  any  nonnegative  length  replace  the  unit  delays.  It  follows 
that  the  function 

p„(S)  ■  .  «*“> 

is  bounded  real  on  the  right  half-plane  for  all  a  >  0  or  equi¬ 
valently  (in  an  obvious  notation)  that  R  is  bounded  real  on 

the  set  S  ,  3  >  0.  While  it  is  not  true  that  U  like  this 
a  a  cl 

fills  out  the  bidisk  (U  S  is  three-dimensional,  while  the 

C  CL 

bidisk  is  four-dimensional ) ,  it  is  true  that  as  a  varies  over 

the  nonneoative  real  numbers,  the  set  0(8  )  drawn  in  Fig.  1 

0  Z 

moves  in  such  a  way  as  to  sweep  out  all  of  T  ,  with  the  excep¬ 
tion  of  the  set  (labeled  C  in  Fig.  1): 

C  *  {(91,e2)|91  «  0,  e2  1  0}  . 

Considering  the  known  results  about  the  stability  of  two- 
variable  systems  in  terms  of  their  behavior  on  T  [6,7],  and 
of  their  behavior  on  three-dimensional  subsets  of  U  [12]  it 
is  then  quite  plausible  that  a  necessary  condition  for  passive 
synthesis  is  that  R( Z1 .Z^)  be  bounded  real  on  all  of  U2. 

This  in  fact  is  well  known  to  be  the  case,  and  can  be  proved  by 
considerations  along  the  above  lines.  However,  since  a  much 


21  7 


0.  HURRAY 


7.  SYSTEMS  WITH  IRRATIONALLY  RELATED  DELAYS.  We  have  been 
concerned  throughout  with  systems  whose  delays  are  all  integer 
multiples  of  one  fundamental  delay.  If  there  are  n  independent 
delays  over  the  rationals.  then  one  gets,  in  the  usual  way, 
rational  functions  in  n+1  variables.  We  remark  in  passing 
that  if  one  has  two  independent  delays  without  any  differentia¬ 
tions  (lumped  elements)  one  will  again  get  rational  functions  in 
two  variables.  However,  the  theory  here  is  very  different  from 
the  lumped-distributed  case  discussed  previously,  since  now  the 
spectrum  is  all  of  T -,  and  the  causal  spectrum  is  all  of  0  . 
For  this  reason,  the  stability  theory  of  pure-delay  systems  with 
several  incoetnensurable  delays  is  much  simpler  than  that  of  dif¬ 
ferential-delay  systems. 

In  the  general  case,  we  will  merely  indicate  what  the  spectrum 
looks  like.  For  systems  Involving  n  independent  delays  and 
differentiations,  o{8)  consists  of  a  line  together  with  an  n- 
dimensional  torus,  the  line  being  asymptotic  to  a  dense  line 
(i.e.,  one  which  winds  around  the  torus  at  an  irrational  angle) 
on  the  torus. 

The  set  S  (the  causal  spectrum)  consists  of  o(8)  as 
described  above  together  with  one-dimensional  analytic  set  in 
U  ,  and  an  n-dimensional  polydisc  (whose  distinguished  bound¬ 
ary  is  the  n-dimensional  torus  mentioned  above)  contained  in 
the  boundary  of  Un+1 .  Again,  the  spectrum  of  the  convolution 
algebra,  o(8),  can  be  found  by  examining  the  image  of  the 
imaginary  axis  under  the  mapping  which  transforms  the  one¬ 
dimensional  problem  into  an  (n+1 )-dimensional  problem;  that  is, 
the  several-variable  aporoach  may  be  regarded  as  a  vehicle  for 
computing  the  spectrum. 

Finally,  the  "stability  index  grouo"  is  isomorphic  to  2Zn+1; 
a  more  detailed  analysis  shows  that  when  one  allows  arbitrarily 
many  delays  of  arbitrary  lengths,  this  group  is  isomorphic  to 
(see  [10,11]). 
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8.  CONCLUSIONS.  Our  main  purpose  here  has  been  to  give  a 
discussion  of  the  radical  differences  which  can  occur  between 
algebras  of  systems  which  are  formally  similar.  These  differ¬ 
ences  may  arise  from  the  problems  under  consideration,  as  in  the 
difference  between  the  problems  of  stability  and  passive  syn¬ 
thesis;  or  from  differences  in  the  classes  of  systems  themselves 
as  in  the  stability  problem  for  systems  composed  of  differentia¬ 
tors  and  integral  multiples  of  one  delay  compared  to  the  same 
problem  for  systems  involving  two  irrationally  related  delays. 

A  further  objective  has  been  to  demonstrate  the  utility  of 
the  Gelfand  spectrum  in  connection  with  these  problems.  In  the 
classical  cases  of  either  lumped,  conti nuous-ti me  systems  or 
discrete-time  systems  the  Gelfand  spectrum  is  a  circle,  (or  a 
disk  in  the  causal  case),  and  the  usefulness  of  the  representa¬ 
tion  of  elements  of  the  system  as  functions  from  this  set  into 
I  is  well  known.  In  mixed  differential-delay  systems,  the 
spectra  are  considerably  more  complicated  objects,  and  the  intui 
tion  to  be  gained  by  studying  the  action  of  individual  systems 
on  these  spectra  is  not  quite  so  transparent.  Nonetheless,  it 
is  hoped  that  the  above  discussion  has  shown  that  even  in  the 
nonclassical  cases  the  spectrum  is  significant,  and  that  know¬ 
ledge  of  the  spectrum  of  the  algebras  involved  can  give  a  deeper 
insight  into  the  behavior  of  the  systems. 
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Abstract 


A  method  is  described  for  the  design  of  two-dimensional  half -plane 
recursive  digital  filters,  in  the  form  of  a  cascade  connection  of  filters 
which  are  of  second  order  in  the  (principal)  direction  of  recursion,  and 
of  arbitrarily  high  order  in  the  other  direction.  The  filters  thus  derived 
are  shown  to  be  automatically  stable,  but  yield  poor  responses  in  the 
vicinity  of  very  wide  or  very  narrow  bandwidths.  Some  techniques  for 
tackling  these  difficulties  are  discussed,  and  the  results  of  applying 
these  design  procedures  are  shown. 
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THE  OESIGN  OF  2-D  FILTERS  AS  1-D  TIME-VARYING  SYSTEMS 


Abstract  '■ 

In  a  previous  paper  a  theoretical  approach  to  two-dimensional  recursive 
digital  filtering  was  proposed.  This  approach  was  based  on  the  fact  that 
two-dimensional  recursive  filtering  with  the  normal  "scanning"  recursion 
is  in  reality  a  one  (time)  -  dimensional  periodically  time-varying  discrete¬ 
time  operation.  Time-varying  systems  are  normally  intractable;  periodically 
time-varying  discrete-time  systems,  however,  are  among  the  few  classes  of 
time-varying  systems  for  which  a  complete,  closed-form  theory  exists.  This 
theory  transforms  single-input  single-output  time-varying  systems  of  period 
N  (where  we  assume  that  the  sampling  period  is  1)  into  a  subclass  of  N-input, 
N-output  time-invariant  systems.  There  is  a  possibility  of  using  classical 
multi -input  multi -output  time-invariant  system  theory  to  design  2-D  filters 
in  this  setting. 

There  are  two  problems  which  arise.  Firstly,  N  is  usually  very  large, 
and  so  direct  manipulations  are  impossible.  Secondly,  the  time-varying  sys¬ 
tems  correspond  to  only  a  subset  of  multi -input  multi -output  systems,  and 
the  classical  design  methods  do  not  necessarily  yield  a  design  in  this 
subset. 

This  paper  exhibits  a  desiqn  procedure  which  takes  care  of  both  of 
these  problems,  and  shows  examples  of  filters  designed  by  using  this 
procedure. 
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6.  Summary 

Although  one  might,  upon  a  cursory  investigation,  conclude  that  the  image 
processing  problem  was  simply  a  two  dimensional  generalization  of  the  standard 
1-D  signal  processing  problem  this  is  not  the  case  since  both  the  signal  and 
noiAe  phe.nome.na.  encountered  -in  the  image  proceAAing  problem  tend  to  be  either 
nonlinear  and/or  Apace  variant,  (the  spatial  analog  of  time-varying).  Indeed, 
both  photo-electric  Ahot  noiAe  and  ^ilm  grain  noiAe  are  highly  signal  dependent 
in  nature  while  the  edge  ejects  in  a  finite  image  introduce  space-variant 
effects.  Although  a  theory  for  coping  with  these  nonlinear  and  space-variant 
effects  has  been  developed  in  a  1-D  setting,  in  the  2-D  image  processing  problem 
these  techniques  have  proven  to  be  computationally  prohibitive.  As  such,  the 
present  work  unit  is  directed  towards  the  problem  of  developing  computationally 
viable  algorithmA  £or  the  digital  image  proceAAing  problem.  These  include 
Aub-optimal  algorithmA  for  estimation  in  signal -dependent  noise,  analytic 
techniques  for  reducing  the  elective  dimenAionality  of  an  image  and  techniques 
for  exploiting  the  Aignal  dependent  nature  of  the  noiAe  phenomena. 
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Estimation  in  signal-dependent  film-grain  noise 


Gary  K.  Froehlich,  John  F.  Walkup,  and  Thomas  F.  Krile 


Optimal  Mtiaators  ire  derived  for  t  signal -dependent  film-grain  noiae  model,  and  the  effect  of  signal-de¬ 
pendence  on  the  estioaton's  structures  is  investigated.  Due  to  the  mathematical  complexity  of  these  opti¬ 
mal  estimators,  various  suboptimal  estimators  are  proposed.  Computer  emulations  ere  then  presented 
which  compare  the  optimal  and  suboptimal  estimators  with  regard  to  mean  square  estimation  error,  sensitiv. 
ity  to  signal-dependence,  and  robustness  with  respect  to  the  a  priori  signal  probability  density  function. 


1.  Introduction 

A  number  of  physical  noise  processes  are  inherently 
signal-dependent  These  include  photoelectronic  shot 
noise,1  magnetic  tape  recording  noise,2  and  of  course 
photographic  film-grain  noise.  ^  In  an  earlier  paper,4 
a  very  general  model  was  presented  which  embodies  all 
the  just  mentioned  processes  and  more.  That  model 
is  presented  again  in 

r  *  s  +  kfU)n\  +  nj,  (1) 

where  r  is  the  noisy  measurement;  s  is  the  underlying 
signal  to  be  estimated,  which  is  generally  characterized 
by  its  probability  density  function  p(s);  /(s)  is  a  zero- 
memory  (spatial)  function  of  the  signal;  n\  and  n2  are 
signal-independent  random  noise  processes;  and  k  is  a 
scalar  constant  which,  when  set  equal  to  zero,  yields  the 
signal-independent  additive  noise  modeL  It  is  further 
assumed  that  ni,  n2,  and  s  are  mutually  statistically 
independent. 

It  has  already  been  shown  that  estimators  which  ig¬ 
nore  signal-dependence  of  noise  pay  a  penatey  in  terms 
of  mean  square  estimation  error  (MSEE)  and  that  in¬ 
clusion  of  the  signal-dependence,  while  increasing  the 
complexity  of  estimators,  results  in  potentially  superior 
performances.4-5  In  this  paper,  we  intend  to  compare 
various  estimators  with  regard  to  ease  of  implementa- 
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tion,  MSEE,  sensitivity  to  signal -dependence,  and  ro¬ 
bustness  with  respect  to  the  a  priori  probability  density 
p(s).  This  is  achieved  through  a  comparison  of  the 
estimators ’s  structures  and  by  using  computer  simula¬ 
tions. 

To  simplify  the  comparisons  as  much  as  possible, 
various  assumptions  about  the  model  of  Eq.  (1)  are 
necessary.  The  signal-independent  noise  terms,  rtj  and 
n2,  are  assumed  to  be  zero-mean  normal  random  vari¬ 
ables  with  variances  of  and  of,  respectively.  Nota- 
tionally,  this  is  represented  by 

<*i  -  N(0,<7?).  ;  ■  1,2.  (2) 

Furthermore,  a  specific  function  for  f{$)  must  be  chosen. 
The  function  f(s)  =  sp  is  of  particular  interest,  as  it 
represents  photographic  film-grain  noise  when  s  rep¬ 
resents  photographic  density,  p  is  between  0.2  and  0.7, 14 
and  k  is  a  scanning  constant  The  exponent  p  is  taken 
to  be  0.5  in  the  remainder  of  this  paper.  Thus 

/(*)  -  vT  (3) 

The  final  assumption  is  with  regard  to  the  a  priori 
probability  density  function  (pdf)  of  the  signal,  i.e.,  pis). 
Several  cases  are  treated  in  Refs.  4  and  5.  These  in¬ 
clude  the  Gaussian,  Rayleigh,  uniform,  discrete-uni¬ 
form,  and  folded-normal6  pdfs,  which  were  chosen  for 
their  tractability  or  their  positivity  constraints.7  For 
purposes  of  comparison  and  brevity,  only  the  Gaussian, 
Rayleigh,  and  uniform  cases  will  be  presented  here. 
With  the  above  assumptions,  the  model  becomes 

r  ■  s  +  k  vTnl  +  n8,  (4) 

and  the  conditional  measurement  r,  given  s,  is  distrib¬ 
uted  normally  with  mean  s  and  variance 

u(«)  ■  (5) 

or,  in  the  notation  of  Eq.  (2), 

p(r|x)  — (6) 
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Fig.  1.  MMSE  estimator  structure  for  Gaussian  pis)-,  solid  Line  ( a\ 
”  0.3,  k2rx  varies);  dashed  line  (fcs<r?  ■  0.3,  a\  varies);  pis)  ~  Ni  1.5, 
0.25). 


II.  Optimal  Estimation — Minimum  Maan  Square  Error 

Optimal  estimators  based  on  the  measurement  model 
of  Eq.  (4)  are  now  presented.  The  first  optimality  cri¬ 
terion  considered  is  minimization  of  MSEE,  or  equiv¬ 
alently,  minimization  of  the  Bayes’s  risk  for  a  quadratic 
cost  function.19  Computationally,  this  is  merely  the 
conditional  mean  of  the  a  posteriori  probability  density 
function  p(s|r).  Thus  the  minimum  mean  square  error 
(MMSE)  estimate  is  given  by 


iMMSE  ”  sp(i|r)ds.  (7) 

Unfortunately,  the  a  posteriori  density  p($|r)  is  very 
difficult  to  compute.  Bayes’s  rule  can  be  applied, 
however,  to  rewrite  Eq.  (7)  as 


J~  spir\s)pis)ds 

i  MMSE  ■  -  •  (8) 

J*  p(r|*)p(s)dl 

where  p(r|s)  is  given  by  Eq.  (6),  and  some  particular 
form  is  assumed  for  pis).  As  an  example  of  the  com¬ 
plexity  of  this  estimator,  consider  the  computation  of 
Eq.  (8)  when  p(s)  is  Gaussian,  i.e., 

pis)  —  Nip,,aj).  (9) 


In  this  case  p(r|s)p(s)  is  given  by 


p(r|s)p(i) 


ir  -  s)2  (5  - 

,  u<J)  <rf  j. 


(10) 


where  vis)  is  given  in  Eq.  (5).  Similar  complexity  re¬ 
sults  when  pis)  is  assumed  to  be  either  Rayleigh  dis¬ 
tributed  or  uniformly  distributed. 

To  arrive  at  some  general  conclusions  about  the 
MMSE  estimators,  the  structure  of  the  estimators  is 
presented.  The  estimate  'mmse  >s  plotted  vs  the 
measurement  r  in  Figs.  1-3.  In  Fig.  l,p(s)  is  assumed 
to  be  Gaussian,  in  Fig.  2  it  is  assumed  Rayleigh,  and  in 
Fig.  3  it  is  assumed  uniform.  In  each  figure,  the  solid 


lines  represent  cases  in  which  a\,  the  variance  of  the 
signal-independent  noise  term,  is  fixed  while  k2<rh 
which  is  proportional  to  the  variance  of  the  signal- 
dependent  noise  term,  is  allowed  to  vary.  The  dashed 
lines  represent  cases  in  which  k2a\  is  fixed  and  ar%  is  al¬ 
lowed  to  vary.  The  actual  numerical  values  of  all  the 
parameters  are  realistic,  and  they  serve  to  restrict  the 
photographic  density  variables  to  a  practical  range  of 
zero  to  three.10  Also,  inspection  of  Eq.  (4)  shows  that 
when  k  *  0,  the  model  reduces  to  the  classical  signal- 
independent  additive  noise  model  r  *  s  +  i»i. 

Several  general  conclusions  lare  implied  by  exami¬ 
nation  of  Figs.  1-3.  In  all  three  cases,  for  example,  the 
estimators  exhibit  greater  sensitivity  to  the  signal- 
dependent  noise  term  than  to  the  signal-independent 


Fig.  2.  MMSE  estimator  structure  for  Rayleigh  pis):  solid  line  iai 
•  0.3,  (t2of  varies);  dashed  line  (*J<rf  »  0.3,  o\  varies);  p(j)  —  Rayleigh 
with  mean  «  1.25,  c2  “  0.43. 


Fig.  3.  MMSE  estimator  structure  for  uniform  pit):  Solid  line  (<r§ 
■  0.3,  varies);  deshed  line  (*J<rf  »  0.3,  varies);  pis)  —  uniform 
with  mean  ■  1.5.  e2  ■  0.75. 
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Fig.  4.  MAP  estimator  structure  for  Gaussian  pis):  solid  line  (a\ 
*  0.3,  Je=<Ti  varies);  dashed  line  (fe2<rf  »  0.3,  a\  varies);  pis)  —  Nil.5, 
0.25). 


Fig.  5.  MAP  estimator  structure  for  Rayleigh  pis):  solid  line  ia\ 
•  0.3,  k2ef  varies);  dashed  line  »  0.3,  <j  varies);  pisi~  Rayleigh 
with  mean  »  1.25,  c2  «  0.43. 


one.  Also,  in  all  three  cases,  if  either  noise  term  in¬ 
creases  without  bound,  the  estimate  becomes  the  (as¬ 
sumed  known)  mean  of  s.  Furthermore,  note  that  the 
more  pis )  deviates  from  normality,  the  more  nonlinear 
the  estimator  structure  becomes.  This  implies  that 
estimator  performance  will  suffer  if  the  assumed  form 
of  pis)  is  wrong.  Ideally  we  would  like  to  find  a  robust 
estimator,  i.e.,  an  estimator  for  which  the  structure  is 
invariant  to  changes  in  p(s). 

One  final  note  about  the  MMSE  estimator  is  in  order. 
The  complexity  of  Eq.  (8)  would  seem  to  discourage  the 
use  of  this  estimator  on  a  point-by-point  basis  over  an 
image.  However,  if  the  parameters  k,a\,o\,  and  the 


moments  of  s  do  not  change  for  the  class  of  images 
under  study,  the  numerical  integration  of  Eq.  (8)  can  be 
done  off-line  for  the  entire  practical  range  of  measure¬ 
ments  r.  The  only  on-line  operation  then  is  reduced  to 
a  table  look-up  procedure  to  match  the  precalculated 
estimate  with  the  corresponding  measurement. 

III.  Optimal  Estimation — Maximum  a  posteriori 

As  an  alternative  to  minimization  of  mean  square 
error,  we  now  consider  an  estimator  which  minimi*** 
the  Bayes’s  risk8  for  a  uniform  cost  function.  This  turns 
out  to  be  the  conditional  mode  of  the  a  posteriori 
probability  density  function  pisjr).  Since  the  mode  is 
merely  the  peak  value  of  the  pdf,  the  estimator  is  often 
referred  to  as  the  maximum  a  posteriori  (MAP)  esti- 
mator.8,9  The  MAP  estimate  is  treated  in  detail  in  Refs. 
4  and  5.  For  the  model  of  Eq.  (4),  the  MAP  estimate  is 
the  solution  of  a  polynomial.  For  p{s)  normally  dis¬ 
tributed,  the  polynomial  is  cubic.  For  Rayleigh  p(s), 
the  MAP  equation  is  of  degree  four,  and  for  uniform 
pis),  the  polynomial  is  quadratic.  The  latter  case,  with 
uniform  pis),  is  equivalent  to  maximum  likelihood 
(ML)  estimation. 4A8-9  The  ML  estimate  is  used  when 
no  prior  information  about  the  statistics  of  s  is  assumed 
or  known.  The  uniform  distribution  for  pis)  corre¬ 
sponds  to  this  worst  case. 

As  with  the  MMSE  estimator,  the  MAP  estimator 
structures  are  presented  as  plots  of  the  MAP  estimates 
vs  the  measurements  r.  Figure  4  is  the  MAP  estimator 
structure  for  Gaussian  p(s),  Fig.  5  is  the  MAP  estimator 
structure  for  Rayleigh  pis),  and  Fig.  6  is  the  ML  esti¬ 
mate.  Once  again,  the  solid  lines  repru  *nt  the  cases 
wherein  o\  is  fixed  and  k2a{  is  allowed  to  vary,  and  the 
dashed  lines  correspond  to  the  case  where  k2<tf  is  fixed 
and  oi  is  allowed  to  vary. 

The  overall  trends  in  these  three  figures  are  very 
similar  to  those  indicated  in  Figs.  1-3.  Again,  the  es¬ 
timators  become  increasingly  nonlinear  as  pis)  departs 
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from  normality.  Also  the  sensitivity  to  the  signal- 
dependent  noise  term  is  much  greater  than  the  sensi¬ 
tivity'  to  the  signal-independent  noise  term  for  all  three 
cases.  However,  the  value  of  the  estimate  when  k2<7x 
increases  without  bound  is  always  lower  than  the  esti¬ 
mate  when  erf  increases  without  bound.  This  is  quite 
different  from  the  MMSE  estimators  which  converged 
to  the  same  value  regardless  of  which  noise  term  became 
unbounded. 

Another  characteristic  shared  by  both  the  MAP  and 
MMSE  estimators  is  computational  complexity.  It  is 
generally  undesirable  to  have  to  solve  a  polynomial  at 
evepr  pofht  of  measurement  (pixel),  just  as  it  was  un¬ 
desirable  to  integrate  numerically  at  each  point 
However,  if  the  parameters  k,a\,a\,  and  the  statistics 
of  s  do  not  change  appreciably  for  the  class  of  images 
under  study,  the  problem  is  again  reduced  to  one  of  an 
off-line  table  generation  followed  by  on-line  table 
look-ups. 

IV.  Suboptimal  Estimation 

The  major  limitations  of  the  optimal  estimators  just 
presented  were  their  general  computational  complexity 
and  their  sensitivity  to  choice  of  p  (s ).  It  might  prove 
acceptable  to  sacrifice  some  theoretical  performance  for 
ease  of  implementation  and  some  measure  of  robust¬ 
ness.  Toward  this  end,  suboptimal  estimators  are  next 
investigated. 

A.  Weighted  Spatial  Averaging 

As  a  first  step,  the  basic  sample  mean  will  be  ex¬ 
ploited.  The  sample  mean  is  certainly  easy  to  imple¬ 
ment,  and  it  is  also  known  to  be  the  most  robust  esti¬ 
mator  of  the  true  mean.11*14  The  estimation  procedure 
then  is  to  replace  each  measurement  with  the  average 
of  that  measurement  and  its  neighbors.  In  a  2-D  sam¬ 
pled  image,  for  example,  this  might  correspond  to  a 
pixel  and  its  eight  nearest  neighbor  pixels.  Defining  the 
sample  mean  at  the  point  j  as  7j, 
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the  weighted  spatial  average  (WSA)  estimate  is  then 

iwSA;  ”  ?;•  (12) 

Unfortunately,  this  estimator  has  one  rather  severe 
limitation:  it  is  not  robust  with  respect  to  the  spatial 
noise  power  spectrum.  This  occurs  because  the  WSA 
algorithm  is  in  effect  a  Finite-window  spatial  low-pass 
filtering  operation.  As  such,  it  does  not  affect  low- 
frequency  noise,  and  it  destroys  high-frequency  signal 
information. 

B.  Modified  Signal-independent  MAP 

In  an  attempt  to  combine  the  desirable  features  of  the 
WSA  algorithm  and  the  sophistication  of  a  slightly  more 
complex  estimator,  the  MAP  estimator  designed  for 
signid-independent  additive  noise  and  Gai,rsian  signal 
statistics  was  modified.  The  modification  allows  the 
statistics  to  vary  spatially— a  consequence  of  signal- 
dependence.  The  signal-independent  MAP  estimator 
is  given  by4  * 


<r* 


r  +  ■ 


t\ 


(13) 


ai  + 

where  it  was  assumed  that  s  ~N(a,, of).  This  requires 
a  priori  knowledge  of  a*  and  <r„  which  remain  fixed  in 
the  estimator  of  Eq.  (13). 

It  can  be  easily  shown  that  the  expected  value  of  r;  is 
A«,  i-e., 


EVA  *  m»*  (14) 

Thus  we  can  eliminate  the  need  for  a  priori  knowledge 
of  At  and  simultaneously  allow  the  statistics  to  vary 
spatially  by  replacing  At  in  Eq.  (13)  with  an  estimate  of 
At.  namely,  rj.  The  modified  signal-independent  MAP 
(MSIMAP)  estimator  then  becomes 


<ma n.m' 


<ri  +  ir| 


n* 


<r\  . 


(IS) 


where  7 )  was  given  by  Eq.  (11).  Because  the  MSIMAP 
estimator  of  Eq.  (15)  adapts  spatially  over  the  image, 
it  should  out  perform  the  unmodified  MAP  estimator 
of  Eq.  (13)  when  both  are  applied  to  the  signal-depen¬ 
dent  noise  model  of  Eq.  (4). 

A  potential  limitation  of  the  MSIMAP  algorithm  is 
the  requirement  that  of  be  known  a  priori  There  may 
be  situations  where  this  is  not  unreasonable,  such  as 
when  the  energy  of  the  signal  is  known;  however,  there 
may  well  be  situations  when  it  is  quite  unreasonable  to 
require  a  priori  knowledge  of  any  of  the  signal  statistics. 
An  additional  modification  is  required  to  eliminate  this 
defect. 


C.  James-Stein 

Just  as  the  signal  mean  is  adaptively  estimated  in  the 
MSIMAP  algorithm,  the  signal  variance  a2,  can  also  be 
adaptively  estimated  within  the  overall  estimator 
structure.  Note  that  Eq.  (15)  can  be  rewritten  as 


where 


f,  -  Qrj  +  (1  -  Q)Tj, 


Q  *  ff?/(cf  +  a\). 

Equation  (16)  can  in  turn  be  rewritten  as 


(16) 


.17) 


f,  *  ?j  +  Q(ri  ~  rj).  (18) 

Now  the  signal-independent  additive  noise  model  for 
which  the  signal-independent  MAP  estimator  was  de¬ 
signed  has  r  distributed  normally  with  mean  a<  and 
variance  of  given  by 


Thus 


r,  *  of  + 


(19) 


9- 


<r, 


-  1 


li 


(20) 


Rewriting  Q  in  this  fashion  eliminates  the  explicit  re¬ 
quirement  to  know  of.  It  was  assumed  at  the  outset 
that  a\  was  known,  so  the  problem  is  reduced  to  one  of 
estimating  of.  Since  r  is  the  measured  quantity,  the 
obvious  intuitive  estimate  for  o;  is  the  sample  variance, 
employing  the  same  nearest-neighbor  approach  used 
to  compute  the  sample  mean.  Defining  the  sample 
variance  at  location  j  as  u]t  it  is  given  by 


238 


Table  L  Sample  MSEC  for  Gauaalan  pit)  wtm  p.  m  1.5  and  «\  »  0.25 


Cate 

■  0 

4  -  0.03 

*- 

t-a 

«  t 

O  O 

o  o 

*  0.03 

4*0 

Noise 

0.027 

0.014 

0.012 

MMSE 

0.031 

0.011 

0.011 

MAP 

0.024 

0.014 

0.012 

ML 

0.027 

0.014 

0.012 

WSA 

0.163 

0.164 

0.162 

MSIMAP 

0.026 

0.014 

0.012 

JS 

0.026 

0.015 

0.012 

Table  IL  SSmpto  MSEE  lor  RayMflh  pis)  wHh  p. 

*  1.25  and  a\  *  0.43 

-  0 

**df  m  0.01 

*s4  -  0.03 

Com 

4  -  0.03 

4  *  o.oi 

4*0 

Noise 

0.026 

0.012 

0.011 

MMSE 

0.024 

0.012 

0.009 

MAP 

0.025 

0.012 

0.010 

ML 

0.026 

0.012 

0.011 

WSA 

0.297 

0.290 

0.293 

MSIMAP 

0.025 

0.012 

0.011 

JS 

0.025 

0.012 

0.011 

Table  M. 

Sample  MSEE  lor  Olacrete-UnMorm  pis)  with  Moan  «  0.25  and 
Var.  -  0.02 

Case 

*J4  *  o 

4  *  0.03 

*2<n  *  0.01 
ff?  »  0.01 

*s4  -  0.03 

4-0 

Noise 

0.038 

0.011 

0.002 

MMSE 

0.009 

0.007 

0.004 

MAP 

0.015 

0.009 

0.001 

ML 

0.038 

0.011 

0.002 

WSA 

0.015 

0.005 

0.001 

MSIMAP 

0.022 

0.009 

0.002 

JS 

0.027 

0.007 

0.002 

Tibia  IV.  Sample  MSEE  lor  Otacrale-Unllerm  pis)  with  Moan  «  1.S  end 
Var.  a  0.75 


*  0 

a2<4  *  o.oi 

-  0.03 

Case 

4  -  0.03 

4  *  o.oi 

4*0 

Noise 

0.026 

0.014 

0.012 

MMSE 

0.021 

0.014 

0.014 

MAP 

0.023 

0.014 

0.012 

ML 

0.026 

0.014 

0.012 

WSA 

0.021 

0.015 

0.015 

MSIMAP 

0.025 

0.013 

0.012 

JS 

0.016 

0.009 

0.012 

1 

n 

"  rt  -  1 

V^r;  •  • J  )  . 

•  •1 

where  r,  was  defined  by  Eq.  (11). 

With  c;  adaptively  estimated  by  o,,  the  variable  Q 
now  varies  with  location  and  thus  is  subscripted  as  (?,, 
where 

Q,  -  l  -  —  •  (22) 

Vj 

The  final  estimator  resulting  from  these  manipula¬ 
tions  is  given  by 


distribution  and  exhibits  uniformly  lower  MSEE  than 
the  sample  mean.  This  estimator  is  known  as  the 
James -Stein  (JS)  estimator. 15-16  It  can  be  shown16  that 
the  MSEE  is  further  lowered  by  restricting  Q;  to  be 
nonnegative.  With  this  final  modification,  the  (JS) 
estimator  is  given  by 

i,  -  r,  +  QJirj  -  fj),  (24) 

where  a  +  A  max(0,a),  Qj  is  given  by  Eq.  (22),  and  fj  is 
given  by  Eq.  (11). 

An  estimator  which  is  in  some  sense  intermediate 
between  the  MSIMAP  estimator  [Eq.  (15)]  and  the  JS 
estimator  [Eq.  (24)}  can  be  obtained  by  using  the  sample 
variance  Vj  of  Eq.  (21)  to  estimate  the  signal  variance 
a j  rather  than  the  measurement  variance  a?.  This  es¬ 
timator  is  discussed  in  Ref.  5,  where  it  is  called  the 
MSIMAP2  estimator.  It  was  also  discussed  by  Lee,17 
who  showed  some  experimental  results  with  noise- 
degraded  images. 

V.  Simulations 

To  compare  the  optimal  and  suboptimal  estimators 
with  each  other,  it  was  necessary  to  perform  several 
computer  simulations.  Only  a  few  results  are  presented 
in  this  paper.  For  a  very  extensive  tabulation  of  results 
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As  it  turns  out.  this  is  an  empirical  Bayesian  estimator  Fig.  7.  Uncorrupted  signal*  with  discrete-uniform  steustie*:  (») 
which  estimates  the  mean  of  a  multivariate  normal  low-eontr»»i  io». -signal-mean  case;  (b)  high-contrast  case. 
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pendent  and  signal-dependent  noise  (v?  *>  <r|  «  0.01). 


see  Ref.  5.  The  estimators  treated  here  are  as  follows: 
(1)  the  MMSE  estimator  with  p(s )  Gaussian;  (2)  the 
MAP  estimator  with  p(s)  Gaussian;  (3)  the  ML  esti¬ 
mator;  (4)  the  WSA  estimator;  (5)  the  MSIMAP  esti¬ 
mator;  and  (6)  the  JS  estimator. 

These  six  estimators  were  applied  to  measurements 
generated  by  the  noise  model  of  Eq.  (4).  Here  three 
noise  regimes  are  considered.  They  are  (1)  a  signal- 
independent  noise  only  case  [k2aj  =  O.oj  =  0.03).  (2)  a 
signal-dependent  noise  only  case  (<t\  *  0,>?2<rf  =  0.03), 
and  (3)  a  case  with  equal  parts  signal-dependent  and 
signal-independent  noise  (or?  =  k2a\  =  0.01).  The  val¬ 
ues  were  chosen  to  insure  reasonable  SNRs  for  purposes 
of  visual  comparison.  Each  Monte  Carlo  simulation 
employed  256  sample  measurements,  and  200  simula¬ 
tions  were  run  for  each  case. 

The  sample  MSEE  for  each  of  the  six  estimators,  as 
well  as  the  squared  deviation  of  the  noisy  measurement 
from  the  true  signal  (labeled  simply  noise),  is  tabulated 
under  columns  corresponding  to  the  three  noise  mix¬ 
tures  described  above.  There  are  four  such  tables 
presented  in  this  paper.  Table  I  is  for  Gaussian  p(s). 
To  get  some  feeling  about  robustness,  pis)  is  next 
treated  (Table  II)  as  a  Rayleigh  pdf.  To  allow  still 
further  deviation  from  normality,  pis)  is  next  taken  to 
be  a  discrete-uniform  density.  Here,  however,  two  sets 
of  parameters  are  considered.  The  first  corresponds 


to  a  low-contrast  low -signal -mean  case  (Table  III),  while 
the  second  corresponds  to  a  high-contrast  case  (Table 

IV). 

Looking  at  Table  I,  two  things  are  immediately  ap¬ 
parent.  First,  the  MMSE  estimator  does  indeed  exhibit 
minimum  sample  MSEE,  except  in  the  case  where  sig¬ 
nal-independent  noise  dominates.  Second,  the  WSA 
estimator  performs  very  poorly.  This  latter  condition 
occurs  because  of  the  low-pass  filtering  nature  of  the 
algorithm.  The  signal  s  in  this  case  is  a  sample  function 
from  a  pseudorandom  Gaussian  process.  Conse¬ 
quently,  a  low-pass  filtered  version  deviates  more  from 
the  true  signal  than  the  noisy  measurement  itself  does. 
Note  also  that  the  other  estimates  perform  more  or  less 
equally  well. 

Turning  next  to  Table  II,  where  s  is  a  Rayleigh-dis¬ 
tributed  random  variable,  note  that  the  MMSE  esti¬ 
mator  again  has  a  slight  edge  over  the  other  estimators 
in  most  cases.  As  before,  the  WSA  estimator  performs 
very  badly,  again  due  to  the  nature  of  s  and  the  effect 
of  low-pass  filtering.  Also  the  remaining  estimators 
again  perform  about  equally. 

Table  m  shows  quite  different  behavior  on  the  part 
of  the  WSA  estimator.  This  is  primarily  due  once  again 
to  the  nature  of  the  signal.  The  particular  sample 
function  used  as  a  signal  for  this  case  is  shown  in  Fig. 
7(a)  and  is  seen  to  have  very  little  high-frequency  con¬ 
tent  Thus  a  low-pass  filtering  operation  is  ideal  for 
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eliminating  noise  with  a  large  high-spatial  frequency 
component.  The  other  estimators  performed  relatively 
well  in  most  cases  here  also. 

The  last  case  considered  is  the  high-contrast  case, 
with  s  shown  in  Fig.  7(b),  and  the  results  of  the  simu¬ 
lations  given  in  Table  IV.  Due  to  the  large  step  si2e  in 
this  signal,  there  is  a  sizable  high-spatial  frequency 
component  Because  of  this,  the  superior  performance 
exhibited  by  the  WSA  process  in  the  similar  case  above 
is  lost  here.  The  various  optimal  estimators  perform 
rather  poorly  as  well  due  to  drastic  deviation  from  the 
normality  assumption.  It  is  the  spatially  adaptive  es¬ 
timators  which  performed  well  in  this  case,  most  notably 
the  JS  estimator. 

To  aid  interpretation  of  the  data  presented  in  Tables 
I-IV.  consider  the  pictorial  representations.  Figure  8 
illustrates  the  signals  of  Fig.  7  after  corruption  by  equal 
parts  signal-dependent  and  signal-independent  noise. 
These  corrupted  waveforms  are  then  used  as  noisy 
measurements,  and  three  of  the  estimators  are  applied. 
The  MMSE  estimate  is  illustrated  in  Fig.  9,  the  WSA 
estimate  in  Fig.  10  (where  the  low-pass  filtering  effect 
is  evident),  and  the  JS  estimate  is  shown  in  Fig.  11. 
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Fig.  10  WSA  estimates  for  the  noisy  measurements  of  Fi|.  8. 
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Fig.  11.  James-Sieui  estimates  for  the  noisy  measurements  of 

Fig.  8. 


VI.  Conclusions 

Overall,  then,  using  the  MSEE  as  a  performance 
measure,  several  observations  may  be  made.  If  the 
original  signal  has  a  great  deal  of  high-frequency  con¬ 
tent,  as  does  an  image  with  a  lot  of  fine  detail,  the  WSA 
estimator  should  be  avoided.  The  MMSE  estimator 
is  the  most  desirable  when  the  signal  statistics  do  not 
deviate  too  far  from  normality,  with  the  JS  estimator 
becoming  the  better  choice  in  cases  of  large  deviations 
from  the  normality  assumption.  On  the  other  hand,  if 
the  signal  is  known  (or  suspected)  to  have  little  high- 
frequency  content,  the  superior  estimator  is  the  WSA 
algorithm,  especially  when  substantial  deviations  from 
the  Gaussian  assumption  are  also  encountered. 
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Abstract 

A  method  is  given  Cor  matching  a  subpattem  of  a  two-dimensional 
image  against  a  stored  prototype,  where  the  latter  is  defined  on  a 
window  whose  position  and  shape  is  determined  by  the  action  of  a  Lie 
group  of  transformations.  The  method  involves  the  construction  of  a 
path  in  the  control  group  along  which  the  matching  error  decreases 
to  a  local  minimum. 


1.  INTRODUCTION 

A  problem  of  classical  interest  in  pattern 
recognition  is  that  of  determining  the 
presence  or  absence  of  a  particular  sub¬ 
pattem  or  subpattem  class.  In  the  anal¬ 
ysis  of  two-dimensional  imagery  this  can 
take  the  form  of  detection  of  comers  and 
edges  or  the  location  of  a  specific  sil¬ 
houette.  More  particularly,  we  may  be  in¬ 
terested  in  obtaining  an  exact  match  of  a 
specific  portion  of  the  image  to  a  sub¬ 
image,  often  a  prototype,  which  may  appear 
in  an  arbitrary  manner,  varying  in  size, 
location  and  orientation.  This  is  the 
problem  which  is  herein  addressed. 

A  related  question  was  considered  by  Dir- 
ilten  and  Newman  [3]  where  it  was  shown 


how  two  planar  images  could  be  matched  un¬ 
der  arbitrary  affine  transformation  of  the 
plane,  if  a  match  were  at  all  possible.  In 
addition  to  affine  transformations,  an  al¬ 
lowance  was  also  made  for  dilation  of  in¬ 
tensity  scale  such  as  that  which  results 
from  under  or  over  exposure  of  film  within 
latitude  limits.  The  results  cited,  how¬ 
ever,  are  of  little  use  in  matching  sub pat¬ 
terns  ,  since  the  algorithms  are  highly  sen¬ 
sitive  to  the  background  context.  Never¬ 
theless,  the  utility  of  a  group  theoretic 
approach  to  pattern  matching  was  clearly 
demonstrated.  * 

In  the  following  we  present  a  method  for 
performing  a  local  search  for  an  imbedded 
subpattem  of  a  two-dimensional  image.  The 


•This  research  was  supportsd  by  the  Army  Research  Office,  Contract 
DAAG29-30-C-0037  and  by  the  Office  of  Naval  Research,  Contract 
M0014-76-C-1136 . 
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method  is  one  involving  adaptive  control 

of  a  retina  which  seeks  the  desired  sub- 

pattern  by  evolving  along  a  curve  in  the 

space  of  parameters  in  a  direction  which 

assures  improvement  in  the  goodness  of 

fit. 

-  • 

2.  BACKGROUND 

Let  G  be  a  Lie  group  of  transformation  on 
an  analytic  manifold  M.  Suppose  G  has  di¬ 
mension  n  while  M  has  dimension  m.  Let  x 
and  y  denote  the  coordinates  of  elements  f 
and  g  in  G,  respectively,  in  a  patch  con¬ 
taining  the  identity  element  e  of  G.  Also, 
let  p  denote  coordinates  of  an  element  u 
of  M  in  some  patch  in  M.  We  may  then  ex¬ 
press  the  coordinates  i  of  the  product 
h  *  fg  and  the  coordinates  q  of  the  ele¬ 
ment  v  «  gu,  relative  to  suitable  patches, 
by  means  of  analytic  functions 

r  -  J(x,y)  (2.1) 

q  -  K  (y,p)  (2.2) 

K  and  J  are  vector-valued,  having  values 
in  n-dimensional  space  Rn  or  Cn  and  m- 
dimensional  space  Rm  or  c”.  Hereafter  we 
shall  assume  that  these  underlying  spaces 
are  real.  We  denote  the  ith  component  of 
J  by  and  the  jth  component  of  K  by  K^. 

In  order  to  define  the  Lie  algebra  of  G  we 
first  introduce  real-valued  maps  on  G  by 

Pij(X)  *  5yJ<X,y'  1y«e'  (2,3> 

where  i  and  j  each  range  from  1  to  n.  The 
cross-section  p*y  which  consists  of  the 
as  i  ranges  from  1  to  n,  and  j  is  fix¬ 
ed,  may  be  thought  of  as  a  vector  field  in 
Rn.  Such  a  vector  field  attaches  to  a 
point  x  the  vector  P.^tx).  As  such, 
P#2,...,P,n  form  a  basis  for  the  tangent 
space  at  the  point  x  [1,2].  The  infinite¬ 
simal  transformations  of  G  may  now  be  de¬ 
fined  by 

xj  "  JiVx)^  '  <2'4) 


for  j  «  1,2, ... ,n. 


The  differential  operators  so  defined  are 
to  be  considered  as  linear  operators  on  the 
space  of  analytic  functions  on  G,  or,  more 
generally,  on  the  space  of  differentiable 
functions  on  G.  The  Lie  algebra  of  G  is 
simply  the  n-dimensional  vector  space  con- 

I 

sisting  of  all  linear  combinations  of  these 
operators,  and  will  be  denoted  by  L(G)  [2]. 


The  Lie  algebra  of  G  may  also  be  defined  in 
terms  of  its  actions  on  the  manifold  M. 
Analogous  to  (2.3)  we  define 
3K 

^y'PHy-e  (2-5> 

for  a  -  1,2, ...,m  and  j  «  1,2, ...,n.  Pin- 
ally,  as  in  (2.4)  above  we  set 


(2.6) 


The  operators  X£ ,  Xj ,  . . . ,  X^  apply  to 
functions  defined  on  M  and  span  a  Lie  alge¬ 
bra  isomorphic  to  L (G) . 


The  following  result  from  [4]  will  be  used 
later,  and  is  stated  for  reference: 


Theorem  2.1.  Let  f:  M  -  R  be  differenti- 
albe  and  define  F:  G  *  M  -  R,  in  terms  of 
coordinates ,  by 

F(x,p)  -  f (K(x,p) ) .  (2.7) 

Then  for  each  j  *  1,2, ...,n  we  have 

X^  -  X^F.  (2.8) 


Let  us  consider  a  curve  t  -  g(t)  in  G  sat¬ 
isfying  g(0)  *  e.  In  terms  of  a  coordinate 
patch  at  e,  git)  may  be  described  by  a 
curve  x(t)  in  Rn  satisfying  x(0)  »  0.  we 
shall  consider  the  case  in  which  x(t)  is 
given  as  the  solution  of  an  evolution  equa¬ 
tion  of  the  form 
n 

x  ( t )  •  l  A.  ( t)  P,i  (x  (t)  )  ,  x  (0)  -  0  ,  (2.9) 
i«l  1 

where  P,1,...P,n  are  cross-sections  of  the 

array  of  functions  given  by  (2.3),  and 

A, (t) , . . .  ,  A_  (t)  are  suitable  control  func- 
l  n 

tions. 
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Now  let  ?  denote  the  coordinates  of  a 
point  u  in  some  coordinate  patch,  for  a 
differentiable  map  f:  M  -  R  we  may  define 
H:  R  *  M  -  R  by  setting 

H ( t , p)  ■  f(g(t)u).  (2.10) 

He  recognize  that  H(t,p)  -  F(x(t) ,p)  where 
F  is  the  extension  of  f  to  G  *  M  as  in 
Theorem  2.1  above.  From  the  point  of  view 
of  application,  if  we  regard  f:  M  -  R  as 
an  image,  then  H(t,p)  represents  the  mov¬ 
ing  image  obtained  by  translation  due  to 
the  curve  g(t).  Also  from  [4],  we  have 

Theorem  2.2.  In  the  context  above , 

if  ■  ?  X  (t)X!H.  (2.11) 

at  Lml  i  i 

3.  THE  CONTROL  MODEL 

3y  an  image  we  mean  a  map  f:  M  -  R,  where 
the  value  f(p)  at  a  point  p  fi  M  represents 
the  gray  value  at  the  picture  element  at 
p.  In  practice,  values  are  observed  on  a 
subset  w  m,  which  we  regard  as  a  window 
which  may  be  translated  by  the  action  of  G 
on  M.  Thus,  upon  translation  by  an  ele¬ 
ment  x  e  G,  the  value  observed  at  p  S  W  is 
given  by  F(x,p)  -  f(K(x,p)),  as  in  (2.7) 
above. 

We  consider  a  given  prototype  3ub-image  V 
defined  on  the  window  w,  V:  W  -  R.  The 
problem  then  is  to  determine  x  S  G  such 
that  F(x,p)  »  V (p)  for  all  p  S  w,  or  deter¬ 
mine  that  no  such  x  exists.  As  a  matter 
of  practice,  we  seek  x  e  G  which  minimizes 
the  objective  function 

Y(x>  -j  |  (F(x,p)  -  V(p))2dp,  (3.1) 

W 

where  dp  represents  a  volume  element  and 
the  integral  is  over  the  window  w,  which 
is  assumed  to  be  of  bounded  volume. 

In  general,  for  any  two  functions  fj^fj: 
w  -  M  we  define 

<f  1 , f 2>  ■  |fj_fjdp  and 
W 

! \:v  !  -  <*i'V1/2  ■ 


Thus,  Y(x)  *  |  |F  -  V | | 2/2 ,  where  x  is  re¬ 
garded  as  a  parameter. 

The  following  is  a  well-known  property  of 
the  Lie  group  G  [2] : 

Lemma  1.  In  order  that  the  differential 
d  (x)  -  0  at  a  point  x  S  G,  it  is  necessary 
and  sufficient  that  eact^X^Y(x)  »  0  where 
X^Xj,  —  ,Xn  are  the  generators  of  L(G) 
given  by  (2.4)  . 

By  direct  calculation,  we  obtain  X^Y(x)  » 

/ ( F ( x , p)  -  V(p))X,F(x,?)dp.  In  practice, 

W 

this  expression  is  difficult  to  compute 
numerically,  due  to  the  presence  of  the 
term  X^F,  which  cannot  be  computed  directly 
from  observed  data.  However,  by  Theorem 
(2.1)  we  have  X^F  «  X|F,  and  the  latter  can 
be  calculated  from  a  single  value  of  x. 

Suppose  now  that  a  curve  in  G  is  given  by 
coordinates  x(t)  obtained  as  a  solution  of 
Equation  (2.9).  We  seek  to  find  \(t)  » 

C Ct)  , . . . ,  A n f t ) )  so  that  4i(t)  *  f(x(t) ) 
decreases  to  a  minimum  value.  Defining 
H (t,p)  *  F(x(t) ,p)  we  obtain, 

ice)  -  /( H(t,p)  -  V(p))|5  (t,p)dp  (3.2) 
W 

which,  by  application  of  Theorem  (2.2)  ,  be¬ 
comes 

.  n  r 

<l>  (t)  -  l  A  .  (t)  (H(z,p)  -  V(p))X,’H(t,p)dp 

i-1  l 

n  <3'3) 

-  I  A.  (t)  <H  -  V,X.’H> 
i-1  1  1 

Upon  observing  that  <H  -  V,X!H>  * 

<F  -  V,X^F>  ■  X^Y  at  x  ■  x(t) ,  we  deduce: 

Theorem  3.1.  If  A^(t)  is  chosen  so  that 
sgnA^tt)  *  -  sgn  <H  -  V,X|H>,  we  have.. 

i(t)  i  0  for  all  t,  with  equality  at  ’  *0 
if  and  only  if  dY  *  0  at  x  ■  x ( t Q ) . 

Among  the  class  of  bounded  controls, 

|Ai  -  (t) [  s  1,  we  see  that  the  rate  of  de¬ 
crease  of  (t)  is  maximized  by  the  choice 
Ai(t)  ■  -  sgn  <K  -  v , x|H> ,  (3.4) 
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for  i.  *  1,2, —  ,n.  Of  course,  other  stra¬ 
tegies  can  be  formulated,  including  steep¬ 
est  descent,  and  some  methods  using  un¬ 
bounded  controls.  By  proceeding  along 
trajectories  defined  by  the  solution  of 
(2.9)  with  A(t)  given  by  (3.4),  we  ap¬ 
proach  a  critical  point  of  T  (i.e.  d  -  0). 
Since  maxima  and  saddle  points  are  un¬ 
stable  under  perturbation,  in  practice 
this  extreme  point  will  always  be  a  mini¬ 
mum. 

4 .  SIMULATION  RESULTS 

The  results  discussed  in  the  previous  sec¬ 
tion  have  been  implemented  by  a  discrete 
algorithm  and  tested  on  simulated  data  IS]. 
A  digitized  two-dimensional  image  was 
first  generated  in  the  form  of  a  large 
two-dimensional  array,  and  the  prototype 
was  generated  in  a  20  *  20  window  array. 

The  image  space  was  assumed  to  be  subject 

to  translation,  magnification  and  rotation. 

giving  rise  to  a  four  parameter  Lie  group 

2 

of  transformations  in  the  plane,  R  . 

A  number  of  cases  were  considered,  includ¬ 
ing  some  involving  multiple  (false)  tar¬ 
gets  and  others  in  which  the  prototype  was 
absent  from  the  image  being  searched.  In 
some  cases  the  image  was  contaminated  by 
5%  random  noise.  In  all  cases  the  search 
was  started  with  overlap  between  the  pro¬ 
totype  target  and  the  image  target. 

The  differential  equation  (2.9)  was  solved 
by  means  of  a  Runge-Kutta  fourth  order 
method,  with  a  dynamic  step  size,  which 
was  increased  as  necessary  to  accelerate 
convergence  and  decreased  as  necessary  to 
maintain  stability.  Integration  was  re¬ 
placed  by  summation,  although  we  conjec¬ 
ture  that  convergence  could  have  been  ac¬ 
celerated  by  the  use  of  a  trapezoid  rule. 

Generally,  search  times  ranged  from  30  to 
50  steps,  with  the  longer  search  tines 
prevailing  for  the  more  difficult  cases. 


In  all  cases ,  the  final  results  were  quite 
reasonable,  even  in  those  cases  where  the 
prototype  was  absent.  In  the  latter  cases, 
the  search  terminated  with  a  ’best"  match, 
with  a  coanensurately  large  final  error. 

As  an  example,  Figure  1  shows  that  starting 
position  for  a  noisy  image  containing  two 
objects.  The  prototype  is  indicated  by  the 
central  silhouette,  while  the  true  target 
is  shifted  upward,  slightly  to  the  right 
and  is  reduced  in  size.  A  false  target 
overlaps  the  lower  right  corner  of  the  pro¬ 
totype  . 


Fig.  1.  Initial  Window  Position. 

The  termination  conditions  are  shown  in 
Figure  2,  where  the  true  target  was  located 
after  49  steps.  All  parameters  ware  cor¬ 
rect  with  the  exception  of  magnification, 
which  was  about  5%  too  large.  Smaller  val¬ 
ues  of  magnification,  however,  increase  the 
error  due  to  the  presence  of  the  false  ob¬ 
ject,  which  is  barely  touching  the  bottom 
edge  of  the  window  in  Figure  2. 
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Fig.  2.  Terminal  Window  Position. 
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SUMMARY 

A  procedure  is  described  in  which  the  output  characteristics  of  an  integrated  circuit  are  optimized  with  respect  to  a  set  of 
variable  fabrication  parameters.  A  simple  RC  coupled  audio  amplifier  is  used  as  an  example.  The  gain-bandwidth 
product  is  obtained  as  a  function  of  oxidation  and  diffusion  times  and  temperatures,  and  the  optimization  is  performed 
by  way  of  a  line  search  using  these  variables  as  the  parameters  of  the  optimization.  The  values  established  for  the 
process  parameters  are  consistent  with  those  employed  for  conventional  fabrication,  and  desired  changes  in  per* 
formance  can  be  obtained,  in  general,  by  a  straightforward  readjustment  of  the  values  of  the  process  variables. 
Although  limited  by  certain  assumptions  and  a  relatively  primitive  circuit,  the  results  demonstrate  the  validity  of  the 
procedure. 


1.  INTRODUCTION 

The  manufacture  of  an  integrated  circuit  can  be  considered  as  a  three-stage  procedure.  Initially,  per¬ 
formance  requirements  are  provided  or  established  by  the  circuit  application.  These  requirements  suggest 
an  interconnection  of  passive  and  active  elements  whose  values  and  geometry  are  determined  during  the 
second  stage.  Finally,  a  suitable  process  is  chosen  to  fabricate  the  circuit  design  of  the  previous  stages.  Every 
fabrication  process  generating  the  necessary  impurity  profiles  is  controlled  by  a  number  of  independent 
variables  which  must  be  assigned  appropriate  values.  The  specification  of  these  values  is  an  integral  part  of 
the  design  and  constitutes  an  implicit  relation  between  the  performance  requirements  and  the  fabrication 
process.  In  practice,  the  process  is  usually  known  in  advance,  but  the  independent  variables  must  still  be 
specified. 

A  situation  which  often  occurs  is  one  where  a  circuit  is  desired  whose  output  characteristics  are  dose  and 
yet  not  identical  to  those  of  a  genetically  related  circuit  for  which  the  entire  three-stage  design  has  been 
completed.  What  is  often  done  in  these  cases  is  to  introduce  appropriate  changes  in  the  second  stage  of  the 
procedure  and  then  redesign  the  third.  Because  of  the  number  of  iterations  involved,  this  practice  turns  out 
to  be  quite  laborious  and  expensive.  In  addition,  more  often  than  not,  the  new  design  calls  for  a  new 
geometry,  and  the  creation  of  new  masks  adds  significantly  to  labour,  cost  and  delays. 

The  technique  described  here  introduces  optimal  changes  at  the  fabrication  stage,  several  steps  beyond 
the  circuit  design  stage.  A  given  process  model  is  incorporated  into  a  given  circuit  model  and  the  predicted 
output  characteristics  are  optimized  with  respect  to  a  specified  set  of  variable  fabrication  parameters.  The 
predicted  output  characteristics  can  be  quite  accurate  if  accurate  circuit  and  process  models  are  employed. 
There  is  certainly  an  abundance  of  reliable  circuit  models;  very  detailed  process  models  are  available  also, 
(see,  for  example.  Reference  1).  The  optimization  is  performed  by  way  of  an  index  which  measures  the 
difference  between  desired  and  predicted  performance  over  a  physically  prescribed  range  of  the  fabrication 
variables.  For  all  practical  purposes,  the  fabrication  of  the  new  circuit  is  obtained  without  any  delay  and  is 
essentially  cost-free.  Additional  constraints  can,  of  course,  be  imposed  to  resolve  questions  of  realizability, 
sensitivity,  thermal  variation,  and  so  on. 

The  technique  is  particularly  useful  for  those  integrated  circuits  which  are  designed  on  a  modular  basis.  In 
these  cases,  a  circuit  consisting  of  several  modules  is  desired,  where  most  or  all  of  the  modules  may  have 
been  previously  designed  independently.  The  designs  are  frequently  incompatible;  the  technique  would 
then  provide  a  unified  fabrication  design  using  the  existing  geometry  of  the  given  modules. 
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The  following  work  applies  the  technique  to  the  fabrication  of  an  audio  amplifier  which  might  itself  form 
the  modular  basis  for  an  operational  amplifier  array.  Although  simplified  circuit  and  process  models  are 
used,  the  results  indicate  the  feasibility  of  the  method,  which  is  the  objective  of  this  paper. 

2.  CIRCUIT  MODEL 

The  amplifier  circuit,  which  incorporates  an  n-p-n  transistor,  is  illustrated  in  Figure  la.  The  specified 
output  function  is  the  voltage  gain,  |A(<t>)|,  which  depends  on  the  circuit  variables  shown^n  Figure  lb;  these, 
excepting  the  emitter  bypass  and  coupling  capacitors,  depend  on  impurity  concentration  profiles 
established  by  the  process.  Because  the  emitter  bypass  capacitance  is  usually  quite  large,  it  was  convenient 
to  make  it  a  fixed  external  capacitor.  The  values  of  Ci  and  C2  are  very  large  and  do  not  enter  the 
calculations. 


(0) 


Figure  1.  (a)  Audio  amplifier  circuit;  (b)  equivalent  circuit  of  the  audio  amplifier 

The  resistances  rbb  and  r„ .  are  obtained  from  the  fixed  dimensions  of  the  base  and  emitter  regions  and  the 
base  sheet  resistance.  Distributed  resistances  R,,  Rj,  R,  and  Re  are  formed  during  the  base  diffusion,  and 
also  depend  on  the  base  sheet  resistance  ;  their  dimensions  are  fixed.  The  Miller  capacitance,  CM,  includes 
the  depletion  capacitances  of  the  reverse-biased  base -collector  and  forward-biased  base-emitter  junctions 
(charge  storage  in  the  base  is  neglected). 

The  average  impurity  concentrations  in  the  emitter  and  base  regions  (and  the  fixed  uniform  collector 
concentration)  determine  the  junction  capacitances.  These  averages,  and  the  base  sheet  resistance  (which 
includes  a  constant  hole  mobility  term),  require  the  emitter  and  base  impurity  profiles  and  the  junction 
depths  in  their  calculation;  the  latter  are  obtained  by  equating  emitter  and  base,  and  base  and  collector, 
concentrations.  The  current  gain,  is  calculated  from  the  average  concentrations,  the  distance  between 
the  junctions,  and  the  minority  carrier  lifetimes  and  diffusion  constants.  Although  these  latter  are 
complicated  functions  of  impurity  concentration,2  approximate  expressions  can  be  obtained. 


3.  PROCESS  MODEL 

Each  term  appearing  in  the  expression  for  the  gain  is  implicitly  related,  through  the  impurity  concentration 
profiles,  to  the  parameters  of  the  process.  The  concentration  profiles  are  the  solutions  of  the  diffusion 
equation, 

mxj)  a:,V(*,t) 
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Table  I.  Optimal  fabrication  parameters  for  specified  gain  in  the 
bandwidth  20  Hz  to  20  kHz.  Gain  is  in  dB,  time  (t)  is  in  seconds, 
and  temperature  ( T)  is  in  *C 


Gain 


I 

3 


no 


35 


30 


25 


20 


Parameter 

20 

25 

30 

35 

40 

3400 

7.140 

7400 

4,690 

6410 

T» 

1.100 

1,130 

1,170 

1400 

1400 

«i 

1400 

UOO 

1430 

1430 

1460 

T, 

1,190 

1.190 

1.190 

1,180 

1,180 

i. 

2,400 

2.400 

2.370 

2.310 

2470 

T. 

1.150 

1,150 

1,150 

1.150 

1.150 

h 

1.230 

930 

930 

870 

870 

T, 

1,100 

1,100 

1,090 

1,090 

1.090 

Table  11.  Optimal  fabrication  parameters  for  specified  gain  in  the 
bandwidth  10  kHz  to  40  kHz.  Cain  is  in  dB,  time  (t)  is  in  seconds. 


and  temperature  (T)  is  in  *C 

Cain 

Parameter 

30 

35 

40 

'» 

5.800 

6,330 

4.140 

7» 

1,160 

1400 

1400 

'i 

1,150 

1460 

1.150 

T , 

1,180 

1,190 

1.180 

». 

2470 

2,340 

2470 

r. 

1.150 

1,160 

1,150 

'j 

880 

630 

940 

t2 

1,090 

1.090 

1,090 

i 
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Figure  2.  Specified  fdashed  lines)  and  optimally  obtained  (solid  lines)  gain  versus  frequency  in  the  range  20  Hz  to  20  kHz 
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assuming  one-dimensional  diffusion  and  a  diffusion  coefficient  which  depends  on  temperature  only.  The 
process  steps  establish  the  boundary  conditions  on  the  solutions.  A  typical  process  sequence  was  chosen  in 
this  work,  consisting  of  a  base  diffusion  blocking  oxidation;  a  p- type  base  deposition  followed  by  a  base 
drive-in  diffusion;  an  emitter  diffusion  blocking  oxidation;  an  a -type  emitter  diffusion;  and  a  pre- 
metalization  oxidation.  The  bias  resistors  are  derived  from  the  base  diffusion.  Photolithography  and  etching 
were  assumed  to  take  place  wherever  required;  metalization  is  not  considered  to  affect  the  impurity 
profiles.  The  circuit  variables  then  depend  on  the  times  and  temperatures  of  the  process  sequence.  These 
times  and  temperatures  are  the  independent  parameters  of  the.optimization;  quantities  which  do  not 
depend  on  them  are  considered  constant.  Eight  significant  process  parameters  were  identified;  these  are 

rb,  the  base  diffusion  time; 

Tb,  the  base  diffusion  temperature; 

r,,  the  emitter  diffusion  blocking  oxide  growth  time; 

Tu  the  emitter  diffusion  blocking  oxide  growth  temperature; 
tc,  the  emitter  diffusion  time; 

T.,  the  emitter  diffusion  temperature; 

t2,  the  pre-metalization  oxidation  time;  and 

Ti,  the  pre-metalization  oxidation  temperature. 

These  parameters  were  optimized  assuming  a  fixed  set  of  mask  dimensions  to  obtain  the  desired  frequency 
response,  |A(<u)|. 


4.  OPTIMIZATION 

The  gain,  |  A(<u)|,  was  calculated  for  an  inital  point,  that  is,  for  an  8-tuple  of  the  independent  parameters,  fb, 
Tb.  Ci,  Tu  f.,  T„  t2,  T2.  |A(<w)|  was  then  compared  with  the  desired  gain,  A0,  by  way  of  the  integral 

/*{  CAo-lAfwlOdw 

where  w2  -  oj,  is  the  desired  bandwidth.  J  is  minimized  using  a  simple  line  search,  that  is,  by  varying  each 
independent  parameter  while  holding  the  others  constant.  In  order  to  ensure  reasonable  values  for  the 
optimal  process  variables,  these  were  constrained;  the  maximum  and  minimum  values  permitted  for 
temperature  were  1,200°C  and  900“C,  respectively,  and  for  time,  7200  s  (2  h)  and  600  s  (10  min), 
respectively. 

The  optimization  was  performed  first  for  several  gains  in  the  bandwidth  20  Hz  to  20  kHz,  and  again  for 
the  higher  gains  in  the  higher  frequency  range,  10  kHz  to  40  kHz.  The  integral  was  evaluated  using  the 
trapezoidal  rule.  1.000  Hz  increments  were  used  throughout  the  calculation  for  the  higher  bandwidth;  for 
the  lower,  1,000  Hz  increments  were  used  above  100  Hz,  and  20  Hz  increments  below  100  Hz.  Tempera¬ 
ture  and  time  were  varied  in  2®C  and  30  s  increments,  respectively. 

5.  RESULTS  AND  DISCUSSION 

The  results  of  the  optimization,  obtained  with  two  to  three  iterations,  are  tabulated  in  Tables  I  and  11.  The 
values  which  were  obtained  for  the  elements  of  the  equivalent  circuit  weTe  quite  reasonable  in  terms  of  a 
conventional  circuit  design;  however,  these  values  are  not  particularly  important,  because  they  are  only  the 
results  of  the  optimization,  and  do  not  participate  in  it.  Even  atypical  values  would  not  be  significant,  as  long 
as  they  could  be  obtained  within  the  constraints  set  on  the  process  times  and  temperatures. 

Figures  2  and  3  illustrate  the  behaviour  of  the  optimal  gains  versus  frequency  in  the  two  frequency  ranges. 
Changes  which  match  moderate  frequency  requirements  are  obtained  easily  by  an  optimal  readjustment  of 
the  values  of  the  fabrication  parameters.  As  requirements  become  more  severe,  as  at  the  higher  frequencies 
of  Figure  3,  deviation  of  circuit  gain  from  desired  values  increases,  finally  becoming  so  targe  that  a  redesign 
at  circuit  level  is  unavoidable. 
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Figure  3.  Specified  (dashed  lines)  and  optimally  obtained  (solid  lines)  gain  versus  frequency  in  the  bandwidth  10  kHz  to  40  kHz 

There  are  two  major  limitations  on  the  accuracy  of  the  calculation.  First,  the  process  model  suffers  from 
certain  deficiencies  because  of  several  assumptions  which  were  made.  For  example,  the  emitter  blocking 
oxide  must  be  grown  thick  enough  to  be  effective.  For  the  tabulated  values  of  h  and  Tt,  the  oxide  thickness 
is  around  2,000  A,3  which  is  not  thick  enough  to  block  the  required  40  min  phosphorous  emitter  diffusion.3 
However,  a  thickness  constraint  can  be  introduced,  although  it  was  not  done  here,  to  assure  appropriate 
oxide  thickness.  Also,  the  line  search  which  was  used  to  obtain  the  optimum  does  not  guarantee  a  global 
minimum.  It  is  important  to  emphasize  again  that  these  results  are  not  intended  to  establish  a  practical 
process,  but  to  demonstrate  the  use  and  feasibility  of  the  technique. 
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Abstract  ... 

Three  algorithms  for  the  solution  of  the  eigenvalue  problem  for 
a  continuous  parameterized  family  of  sparse  matrices  are  presented; 
a  continuous  LU  (or  LR)  algorithm,  a  continuous  QR  algorithm,  and  a 
continuous  Hessenberg  algorithm.  Each  of  .the  three  algorithms  may  be 
implemented  recursively  and  the  sparsity  of  the  given  matrices  is  _  . 
preserved  throughout  the  numerical  process. 
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